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Abstract. Schaefer's theorem is a complexity classification result for 
so-called Boolean constraint satisfaction problems: it states that every 
Boolean constraint satisfaction problem is either contained in one out 
of six classes and can be solved in polynomial time, or is NP-complete. 

We present an analog of this dichotomy result for the propositional 
logic of graphs instead of Boolean logic. In this generalization of Schae- 
fer's result, the input consists of a set W of variables and a conjunction 
$ of statements ("constraints") about these variables in the language 
of graphs, where each statement is taken from a fixed finite set ^ of 
allowed quantifier- free first-order formulas; the question is whether $ is 
satisfiable in a graph. 

We prove that either ^ is contained in one out of 17 classes of graph 
formulas and the corresponding problem can be solved in polynomial 
time, or the problem is NP-complete. This is achieved by a universal- 
algebraic approach, which in turn allows us to use structural Ramsey 
theory. To apply the universal- algebraic approach, we formulate the 
computational problems under consideration as constraint satisfaction 
problems (CSPs) whose templates are first-order definable in the count- 
ably infinite random graph. Our method to classify the computational 
complexity of those CSPs is based on a Ramsey-theoretic analysis of 
functions acting on the random graph, and we develop general tools 
suitable for such an analysis which are of independent mathematical 
interest. 

1. Motivation and the result 

In an influential paper in 1978, Schaefer [25j proved a complexity classifl- 
cation for systematic restrictions of the Boolean satisflability problem. The 
way how he restricts the Boolean satisflability problem turned out to be 
very fruitful when restricting other computational problems in theoretical 
computer science, and can be presented as follows. 

Let = {'01, . . . , ipn} be a flnite set of propositional (Boolean) formulas. 

Boolean-SAT(^^) 

INSTANCE: Given a flnite set of variables W and a propositional formula 
of the form $ = 0i A • • • A 0/ where each 0^ for 1 < i < / is obtained from 
one of the formulas '0 in by substituting the variables of ^ by variables 
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from W. 

QUESTION: Is there a satisfying Boolean assignment to the variables of W 
(equivalently, those of $)? 

The computational complexity of this problem clearly depends on the 
set and is monotone in the sense that if C then solving Boolean- 
SAT(^^') is at least as hard as solving Boolean-SAT(^^). Schaefer's theorem 
states that Boolean- SAT (^^) can be solved in polynomial time if and only if 

is a subset of one of six sets of Boolean formulas (called 0-valid, 1 -valid, 
Horn, dual-Horn, affine, and bijunctive)^ and is NP-complete otherwise. 
We remark that Schaefer's theorem is usually formulated as a classification 
result of Boolean constraint satisfaction problems, but the formulation given 
above is easily seen to be equivalent. 

We prove a similar classification result, but for the propositional logic 
of graphs instead for propositional Boolean logic. More precisely, let E 
be a relation symbol which denotes an antireflexive and symmetric binary 
relation and hence stands for the edge relation of a (simple, undirected) 
graph. We consider formulas that are constructed from atomic formulas of 
the form E{x, y) and x = y by the usual Boolean connectives (negation, 
conjunction, disjunction), and call formulas of this form graph formulas. A 
graph formula $(xi, . . . , Xm) is satisfiable if there exists a graph H and an 
m-tuple a of elements in H such that $(a) holds in H. 

The problem of deciding whether a given graph formula is satisfiable can 
be very difficult. For example, the question whether or not the Ramsey 
number i?(5, 5) is larger than 43 (which is an open problem, see e.g. [18j) 
can be easily formulated in terms of satisfiability of a single graph formula. 
Recall that i?(5, 5) is the least number k such that every graph with at least 
k vertices either contains a clique of size 5 or an independent set of size 5. So 
the question whether or not i?(5, 5) is greater than 43 can be formulated as 
the question of satisfiability of a graph formula using 43 variables xi, . . . , X43 
on which one imposes the following constraints: all variables denote different 
vertices in the graph, and for every five-element subset of the variables we 
add a constraint that forbids that the variables of this subset form a clique 
or an independent set; this can clearly be stated as a graph formula. If 
this graph formula is satisfiable, then this implies that i?(5, 5) < 43, and 
otherwise i?(5, 5) > 43. 

Let = {'01, . . . , tpn} be a finite set of graph formulas. Then gives rise 
to the following computational problem. 

Graph-SAT(*) 

INSTANCE: Given a set of variables W and a graph formula of the form 
$ = 01 A • • • A 0^ where each 0^ for 1 < z < / is obtained from one of the 
formulas 0^ in by substituting the variables from ^ by variables from W. 
QUESTION: Is $ satisfiable? 
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As an example, let be the set that just contains the formula 



(1) 



{E{x,y)A^E{y,z)A^E{x,z)) 

V {^E(x,y)AE(y,z)A^E{x,z)) 

V {^E(x,y)A^E{y,z)AE{x,z)) 



Then Graph-SAT(^^) is the problem of deciding whether there exists a graph 
such that certain prescribed subsets of its vertex set of cardinality at most 
three induce subgraphs with exactly one edge. This problem is NP-complete 
(the curious reader can check this by means of our classification in Theo- 



In this example, Graph-SAT(^^) is the problem of deciding whether there 
exists a graph such that certain prescribed subsets of its vertex set of car- 
dinality at most three induce either a subgraph with exactly one edge, or a 
complete triangle. This problem turns out to be tractable. 

It is obvious that the problem Graph-SAT(^^) is for all contained in 
NP. The goal of this paper is to prove the following dichotomy result. 

Theorem 1. For all "if, the problem Graph-SAT(^^) is either NP-complete 
or in P. Moreover, the problem to decide for given whether Graph-SAT(^^) 
is NP-complete or in P is decidable. 

One of the main contributions of the paper is the general method of 
combining concepts from universal algebra and model theory, which allows 
us to use deep results from Ramsey theory to obtain the classification result. 



We establish our result by translating Graph-SAT problems into con- 
straint satisfaction problems (CSPs) with infinite domains. More specifi- 
cally, for every set of formulas we present a relational structure such 
that Graph-SAT(^^) is equivalent to CSP(r^); in a certain sense, Graph- 
SAT(^^) and CSP(r^) are one and the same problem. The relational struc- 
ture has a first-order definition in the random graph G, i.e., the (up to 
isomorphism) unique countably infinite universal homogeneous graph. The 
random graph belongs to one of the most fundamental cj-categorical struc- 
tures, and is an important structure in model theory that appears also in 
many other areas of mathematics (see [13j). This perspective allows us to 
use the so-called universal- algebraic approach, and in particular polymor- 
phisms to classify the computational complexity of Graph-SAT problems. 
In contrast to the universal- algebraic approach for finite domain constraint 
satisfaction, our proof relies crucially on strong results from structural Ram- 
sey theory; we use such results to find regular patterns in the behavior of 
polymorphisms of structures with a first-order definition in G, which in turn 
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V {^E{x,y)A^E{y,z)AE{x,z)) 

V {E{x,y)AE{y,z)AE{x,z)) . 



2. Discussion of our strategy 
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allows US to find analogies with polymorphisms of structures on a Boolean 
domain. 

We call structures with a first-order definition in G reducts of G. While 
the standard definition of a reduct of a relational structure A is a structure 
on the same domain obtained by forgetting some relations of A, a reduct 
of A in our sense is really a reduct of the expansion of A by all first-order 
definable relations. It turns out that there is one class of reducts V for which 
CSP(r) is in P for trivial reasons; further, there are 16 classes of reducts 
r for which CSP(r) (and the corresponding Graph-SAT problems) can be 
solved by non-trivial algorithms in polynomial time. 

The presented algorithms are novel combinations of infinite domain con- 
straint satisfaction techniques (such as used in [16^ 6^ 2J) and reductions 
to the tractable cases of Schaefer's theorem. Reductions of infinite domain 
CSPs in artificial intelligence (e.g., in temporal and spatial reasoning [17j) 
to finite domain CSPs (where typically the domain consists of the elements 
of a so-called 'relation algebra') have been considered in the more applied 
artificial intelligence literature [29] . Our results shed some light on the ques- 
tion when such techniques can even lead to polynomial-time algorithms for 
CSPs. 

The global classification strategy of the present paper is similar in spirit 
to the one of a recent result in on CSPs of structures which are first-order 
definable in (Q; <). But while in [5] the proof might still have appeared to 
be very specific to constraint satisfaction over linear orders, with the present 
paper we demonstrate that in principle such a strategy can be used for any 
class C of computational problems that satisfies the following: 

• All problems in C can be formulated as a CSP of a structure which 
is first-order definable in a single structure A; 

• A is homogeneous in a finite language and the class of finite sub- 
structures of A has the Ramsey property (as in [23j). 

We remark that in our case, this A is the ordered random graph (roughly 
the random graph equipped with the order of the rationals in a random way 
- confer Section [7| . 

While in [5] , the classical theorem of Ramsey and its product version were 
sufficient, the Ramsey theorems used in the present paper are deeper and 
considerably more difficult to prove [24^ Jj . 

3. Tools from universal algebra and model theory 

In this section we develop in detail the tools from universal algebra and 
model theory needed for our approach. We start by translating the problem 
Graph-SAT(^^) into a constraint satisfaction problem for a reduct of G. 

We denote the random graph by G = [V] E). The graph G is determined 
up to isomorphism by the two properties of being homogeneous (i.e., any 
isomorphism between two finite induced subgraphs of G can be extended to 
an automorphism of G), and universal (i.e., G contains all countable graphs 
as induced subgraphs). Moreover, it satisfies the extension property^ which 
often is useful in combinatorial arguments: for all disjoint finite U^U' 
there exists v such that v is adjacent in G to all members of U and to 



SCHAEFER'S THEOREM FOR GRAPHS 



5 



none in U' . Up to isomorphism, there exists only one unique countably infi- 
nite graph which has this extension property, and hence the property can be 
used as an alternative definition of G. The name of the random graph is due 
to the fact that if for a countably infinite vertex set, one chooses indepen- 
dently and with probability ^ for each pair of vertices whether to connect 
the two vertices by an edge, then with probability 1 the resulting graph is 
isomorphic to the random graph. For the many other remarkable properties 
of G and its automorphism group Aut(G), and various connections to many 
branches of mathematics, see e.g. [13} IMJ. 

Let r be a structure with a finite relational signature r. A first-order 
T-formula is called primitive positive if it is of the form 

3x1, . . . ^1 A • • • A^^, 

where the are atomic^ i.e., of the form yi — y2 or . . . ^yk)i where 

i? G T is a fc-ary relation symbol and the y^ are not necessarily distinct. A 
T-formula is called a sentence if it contains no free variables. 

Definition 2. The constraint satisfaction problem for denoted by CSP(r), 
is the computational problem of deciding for a given primitive positive r- 
sentence $ whether $ is true in F. 

Let = {'01, . . . , ^pn} be a set of graph formulas. Then we define F^ to be 
the structure with the same domain V as the random graph G which has for 
each il^i a relation Ri consisting of those tuples in G that satisfy (where 
the arity of Ri is given by the number of variables that occur impi). Thus by 
definition, F^ is a reduct of G. Now given any instance $ = 01 A - • • A(/)/ with 
variable set W of Graph-SAT(^^), we construct a primitive positive sentence 
in the language of F^ as follows: In we replace every 0^, which by 
definition is of the form tpj{yi, . . . , ym) foi" some I < j < n and variables yk 
from W, by Rj{yi, . . . , ym)i after that, we existentially quantify all variables 
that occur in $^ It then follows immediately from the universality of G that 
the problem Graph-SAT(^^) has a positive answer for $ if and only if the 
sentence holds in F^. Hence, every problem Graph-SAT(^^) is in fact of 
the form CSP(F), for a reduct F of G in a finite signature. We will thus 
henceforth focus on such constraint satisfaction problems in order to prove 
our dichotomy. 

The following lemma has been first stated in [22j for finite domain struc- 
tures F only, but the proof there also works for arbitrary infinite structures. 
It shows us how we can slightly enrich structures without changing the com- 
putational complexity of the constraint satisfaction problem they define too 
much. 

Lemma 3. Let F = (D; i?i, . . . , Ri) be a relational structure, and let R be 
a relation that has a primitive positive definition in F. Then CSP(F) and 
CSP(i?; i?, i?i, . . . , Ri) are polynomial-time equivalent. 

The preceding lemma makes the so-called universal- algebraic approach to 
constraint satisfaction possible, as exposed in the following. We say that 
a fc-ary function (also called operation) f : D preserves an m-ary 

relation R C iff for all ti, . . . ,tk ^ R the tuple f{ti, . . . ,tk) (calculated 
componentwise) is also contained in R. If an operation / does not preserve 
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a relation i?, we say that / violates R. If / preserves all relations of a 
structure F, we say that / is a polymorphism of V (it is also common to 
say that V is closed under /). A unary polymorphism of V is also called an 
endomorphism of F. 

Conversely, for a set F of operations defined on a set D and a relation R 
on we say that R is invariant under F if i? is preserved by all f ^ and 
we write Inv(F) for the set of all finitary relations on D that are invariant 
under F. 

The set of all polymorphisms Pol(F) of a relational structure F forms an 
algebraic object called a clone (see i26j, j^), which is a set of operations 
defined on a set D that is closed under composition and that contains all 
projections. Moreover, Pol(F) is also closed under interpolation (see Propo- 
sition 1.6 in [26j): we say that a fc-ary operation f on. D is interpolated by 
a set of operations F on if for every finite subset A of there is some 
fc-ary operation g ^ F such that g agrees with f on A. We say that F 
locally generates an operation ^ if ^ is in the smallest clone that is closed 
under interpolation and contains all operations in F. Clones with the prop- 
erty that they contain all functions locally generated by their members are 
called locally closed, local or just closed. 

We thus have that to every structure F, we can assign the closed clone 
Pol(F) of its polymorphisms. For certain F, this clone captures the computa- 
tional complexity of CSP(F): a countable structure F is called u- categorical 
if every countable model of the first-order theory of F is isomorphic to F. It 
is well-known that the random graph G is cj-categorical, and that reducts 
of cj-categorical structures are c<j-categorical as well (see for example [21j). 

Theorem 4 (from [7J). Let F be an u- categorical structure. Then the rela- 
tions preserved by the polymorphisms ofT (i.e., the relations in Inv(Pol(F)) ) 
are precisely those having a primitive positive definition in F. 

Clearly, this theorem together with Lemma[3]imply that if two cj-categorical 
structures with finite relational signatures have the same clone of polymor- 
phisms, then their CSPs are polynomial-time equivalent. Recall that we 
have only defined CSP(F) for structures F with a finite relational signa- 
ture. But we now see that it makes sense (and here we follow conventions 
from finite domain constraint satisfaction, see e.g. [12]) to say for arbitrary 
structures F that CSP(F) is (polynomial-time) tractable if the CSP for every 
finite signature structure A with the same polymorphism clone as F is in P, 
and to say that CSP(F) is NP-hard if there is a finite signature structure A 
with the same polymorphism clone as F whose CSP is NP-hard. 

Note that the automorphisms of a structure A are just the bijective unary 
polymorphisms of A which preserve all relations and their complements; the 
set of all automorphisms of A is denoted by Aut(A). It follows from the 
theorem of Ryll-Nardzewski (cf. that for cj-categorical structures A, 

the closed clones containing Aut(A) are precisely the polymorphism clones 
of reducts F of A. Therefore, in order to determine the computational 
complexity of the CSP of all reducts F of G, it suffices to determine for 
every clone clone C containing the automorphism group Aut(G) of G the 
complexity of CSP(F) for some reduct F of G with Pol(F) = C; then the 
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complexity for all reducts with the same polymorphism clone is polynomial 
time equivalent to CSP(r). 

The following proposition is the analog to Theorem |4] on the "operational 
side" , and characterizes the local generating process of functions on a domain 
D by the operators Inv and Pol. 

Proposition 5 (Corollary 1.9 in [26j). Let F he a set of functions on a 
domain D, and let g he a function on D. Then F locally generates g if 
and only if g preserves all relations that are preserved hy all operations in 
F (i.e., if and only if g ^ Pol(Inv(F)) ). 

For some reducts, we will find that their CSP is equivalent to a CSP of 
a structure that has already been studied, by means of the following basic 
observation. 

Proposition 6. Let F, A he homomorphically equivalent^ i.e., they have the 
same signature and there are homomorphisms / : F ^ A and g : A ^ T. 
Then CSP(F) = CSP(A). 

We finish this section with a technical general lemma that we will refer to 
several times in the proof; it allows to restrict the arity of functions violating 
a relation. For a structure F with domain D and a tuple t G D^^ the orhit 
of t in r is the set {a{t) \ a G Aut(F)}. 

Lemma 7 (from [5]). Let T he a relational structure with domain D, and 
suppose that R C intersects not more than m orhits of k-tuples in V. 
Suppose that an operation f on D violates R. Then {/} U Aut(F) locally 
generates an at most m-ary operation that violates R. 

4. Overview of the proof 

Throughout the text, F denotes a reduct of the random graph G = (V] E). 
The proof of Theorem [T] can be organized in three steps as follows. 

The first step is providing hardness proofs for certain relations. More 
precisely, we define three relations P^'^\ T, and H which have first order- 
definitions in G, and show hardness for the CSP defined by each of these 
relations by reducing known NP-hard problems to this CSP. We then know 
from Lemma [3] that if the CSP for a reduct F is not NP-hard, then there is 
no primitive positive definition of any of these relations in F. This implies 
that there are polymorphisms of F which violate the NP-hard relations, by 
Theorem HI 

We then analyze the polymorphisms of F which violate P^^\ T, and H. 
The first, rather basic tool here is Lemma [7| which we use in order to get 
bounds on the arity of such polymorphisms. The deeper part of our analysis 
is the simplification of the polymorphisms by means of Ramsey theory. It 
turns out that the polymorphisms can be assumed to behave regularly in a 
certain sense with respect to the base structure G (the technical term for 
functions showing such regular behavior will be canonical)^ making them 
accessible to case-by-case analysis. 

Finally, the presence of canonical polymorphisms is used in two ways: in 
the case of canonical unary polymorphisms, the image under such a poly- 
morphism sometimes is a structure A for which the CSP has already been 
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classified, and then one can refer to Proposition |6] to argue that the CSP(r) 
is polynomial-time equivalent to the CSP of this structure A. The second, 
and in our case more important way of employing canonical polymorphisms 
is to prove tractability of CSP(r) by transforming the polymorphism into an 
algorithm. Here, we adapt known algorithms showing that certain polymor- 
phisms on a Boolean domain imply tractability of Boolean CSPs in order to 
prove that the same holds for their canonical counterparts on the random 
graph. 

It turned out convenient to split our proof into two cases, the first one 
dealing with the case where either E or the relation N{x^y) defined by the 
formula y) /\ x ^ y has no primitive positive definition in F, and 

the second one dealing with the case where both E and N are primitive 
positive definable in P. In theory, both cases follow the steps described 
above. However, the assumption for the first case implies the presence of 
endomorphisms of P which violate either E or N ^ and in turn allows for 
the use of known results on such endomorphisms, in particular from [28j 
and 0. These older results have been obtained using Ramsey theory, and 
thus by building on them we "outsource" the Ramsey-theoretic analysis of 
polymorphisms of P in this case. In the second case, all endomorphisms 
of P will be trivial in the sense that they preserve E and N (which means 
that they locally look like automorphisms of G), and thus the higher arity 
polymorphisms of P must be considered to a level of detail not present in 
the literature (although we do also draw on earlier results on such higher 
arity polymorphisms from [8j). It is here where we apply Ramsey theory 
directly in our paper. 

Section |6] corresponds to the first case. The second and more involved case 
is dealt with in two sections: Section [7] is purely mathematical and derives 
the presence of certain nice polymorphisms in reducts of G in which the re- 
lations E and N but not a certain 6-ary hard relation H (see Definition 27) 
are primitive positive definable; and Section [8] presents polynomial-time al- 
gorithms for reducts having these polymorphisms. 

The proof of the dichotomy claimed in Theorem [l] is followed by Section [9] 
in which the theorem is stated in more detail and the decidability part of 
the theorem is derived. 



5. Additional conventions 

Since all our polymorphism clones contain the automorphism group Aut(G) 
of the random graph, we will abuse the notion of generates from Section [3| 
and use it as follows: for a set of functions F and a function g on the domain 
y, we say that F generates g when FU Aut(G) locally generates g] also, we 
say that a function / generates g if {/} generates g. That is, in this paper 
we consider the automorphisms of G be present in all sets of functions when 
speaking about the local generating process. 

The binary relation A^(x, y) on V is defined by the formula -i£^(x, y) /\x ^ 
y. We use ^ both in logical formulas to denote the negation of equality, and 
to denote the corresponding binary relation on V. 

When t is an 7T/-tuple, we refer to its entries by ti, . . . , t^^,. 
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6. Endomorphisms 



The goal of this section is the proof of Proposition [T0| which wih ahow 
us to reduce the classification task to the classification of those structures 
where the relations N and 7^ are primitive positive definable. Before we 
can state the proposition we have to define two hard relations called P^^^ 
and T. 

Definition 8. For all /c > 3, let P^^^ denote the fc-ary relation that holds 
on xi, . . . , x/e G y if xi, . . . , x/e are pairwise distinct, and the graph induced 
by {xi, . . . , x/e} in G is neither an independent set nor a clique. 

Definition 9. Let T be the 4-ary relation that holds on xi,X2,X3,X4 G V 
if xi, X2, X3, X4 are pairwise distinct, and induce in G one of the following: 

• a single edge and two isolated vertices; 

• a path with two edges and an isolated vertex; 

• a path with three edges; 

• the complement of one of the structures stated above. 

Proposition 10. Let T be a reduct of G. Then at least one of the following 
holds. 

(a) r has a constant endomorphism, and CSP(r) is trivial. 

(b) r is homomorphically equivalent to a countably infinite structure that 
is preserved by all permutations of its domain; in this case the com- 
plexity 0/ CSP(r) has been classified in [4J^ and is either in P or 
NP-complete. 

(c) There is a primitive positive definition of P^^^ or T in V, and CSP(r) 
is NP-complete. 

(d) The relations N , E, and ^ have primitive positive definitions in V. 

To prove the proposition, we first cite a result about the reducts of the 
random graph due to Thomas [28] (also see [8] for a formulation of this result 
as it is used here). 

The graph G contains all countable graphs as induced subgraphs. In par- 
ticular, it contains an infinite complete subgraph. It is clear that any two 
injective operations from V ^ V whose images induce infinite complete sub- 
graphs in G generate one another; let e^; be one such operation. Similarly, 
G contains an infinite independent set. Let cat be an injective operation 
from V ^ V whose image induces an infinite independent set in G. 

Theorem 11. Let T be a reduct of G. Then one of the following cases 
applies. 

(1) r has a constant endomorphism. 

(2) r has the endomorphism ce- 

(3) r has the endomorphism ejsf. 

(4) The endomorphisms ofV are generated by Aut(r). 

It has been shown in [27] that there exist precisely five closed permutation 
groups that contain Aut(G); by the theorem of Ryll-Nardzewski (confer the 
discussion in Section [s]), these groups correspond precisely to the automor- 
phism groups of reducts of G. Thus, the last case of Theorem [TT] splits into 
five sub-cases, one for each group of the form Aut(r), and knowledge about 
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these groups will be important for our complexity classification. We will 
next cite the theorem that lists them. 

For any finite subset S ofV, if we flip edges and non-edges between S and 
V\S in then the resulting graph is isomorphic to G (it is straightforward 
to verify the extension property). Let is be such an isomorphism for each 
non-empty finite S. Any two such functions generate one another. We also 
write sw for where G is any fixed element of V. 

Let be the fc-ary relation that holds on. xi, . . . ,Xk G V if xi^ . . . ^x^ 
are pairwise distinct, and the number of edges between these k vertices is 
odd. Note that i?^^) is preserved by — , i?^^) is preserved by sw, and that 
is preserved by — and by sw, but not by all permutations of V. Observe 
also that there exist automorphisms q:,/3 of G such that —{a{—{x))) and 
sw{P(sw{x))) are the identity functions on V; hence, if — or sw preserve a 
relation R with a first-order definition in G, they automatically preserve also 
the negation of i?, and hence are automorphisms of the structure {V; R). 

Definition 12. We say that two structures F, A on the same domain are 
first- order interdefinable if all relations of F have a first-order definition in 
A and vice- versa. 

Theorem 13 (from [27j). Let T be a reduct of G. Then exactly one of the 
following is true. 

(1) F is first- order interdefinable with (V'^E); 
equivalently, Aut(F) = Aut(G). 

(2) F is first- order interdefinable with {V'^R^^^); 
equivalently, Aut(F) contains but not by sw. 

(3) F is first- order interdefinable with {V'^R^^^); 
equivalently, Aut(F) contains sw, but not by —. 

(4) F is first- order interdefinable with {V]R^^^); 

equivalently, Aut(F) contains — and sw, but not all permutations of 
V. 

(5) F is first- order interdefinable with {V]—); 
equivalently, Aut(F) contains all permutations ofV. 

In particular, the reducts (V; P^^^) and (V; T) are both first-order interde- 
finable with one of the five structures of this theorem; we will now show with 
which one, and prove hardness for both reducts. We start with ('\/;P(^)). 

Proposition 14. The structure iV'^P^^^) is first- order interdefinable with 
(y;i?(4)). 

Proof. The relation P^^^ is not preserved by sw. ii xi,X2 ^ V are so 
that E{xi,X2) and N{0,Xi) hold for z = 1,2, then (0,xi,X2) G P^^\ but 
{sw{{)), sw{xi), sw{x2)} is a clique in G, so {sw{{)), sw{xi), sw{x2)) ^ P^^\ 



On the other hand, P^^^ is clearly preserved by — . Hence, Theorem 13 
implies that iV'^P^^^) is first-order interdefinable with {V]R^^^). □ 

For /c > 3, let Q^^^ be the fc-ary relation that holds for a tuple (xi, . . . , x^) G 
i^ xi, . . . ,Xk are pairwise distinct, and (xi, . . . ,Xk) ^ P^^\ When prov- 
ing hardness of CSP((y; P^^^)), we will make use of the fact that Q'^^^ is 
primitive positive definable in iV'^P^^^). 
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Lemma 15. The relation Q^^'' is primitive positive definable in ('\/;P(^)). 

Proof. We first show that the relation Q^^^ is primitive positive definable by 
the relation P^^\ Observe first that ^ is primitive positive definable from 
p(^^ by the formula 3u. P^^\x, y, u). Let fi be the primitive positive formula 
that states about the tuple (xi, . . . ^xq) that all its entries are distinct. Let 
(/) be a conjunction of atomic formulas with variables from xi, . . . ,X6 that 
contains for each three-element subset {u^ w} of those variables a conjunct 
P^^\u,v,w). It is known that every two-coloring of the edges of the graph 
Kq (the clique with six vertices) contains a monochromatic triangle. There- 
fore, (/) A fi is unsatisfiable. Let 0' be a conjunction over a maximal subset 
of the conjuncts of with the property that (j)' f\ ji is still satisfiable, and 
suppose without loss of generality that the conjunct P^^\xi^X2^x^) of (j) is 
missing in (f)' . We claim that := 3x/^^x^^XQ.(j)' A ji defines Q^^\ To see 
this, suppose that t is a triple that satisfies '0; that means that there exists 
a 6-tuple 8 extending t which satisfies (j)' Afi. From the maximality of 0', we 
infer that 5, and hence also t, does not satisfy P(^)(xi, X2, X3), so t G Q^^\ 
Therefore, defines a subset of Q^^\ Observe also that does not define 
the empty set since (j)' A /x, and hence also '0, is satisfiable. Let t be any 
tuple satisfying '0. It is easy to check that any if a, h are triples in Q^^\ then 
there is a permutation / generated by — which sends a to h. Since P^^^ is 
preserved by — , this permutation / is an automorphism of P^'^\ so we can 
reformulate this observation by saying that Q^^^ is contained in one orbit 
of (y;P(3)). Hence, for any tuple w in Q^^^ there exists an automorphism 
of iV'^P^^^) that sends t to w. This automorphism clearly preserves '0, and 
hence w satisfies '0, proving the claim. 

Now observe that Q^^\x,y,u) A Q^^\y,u,v) A Q^^\x,y,v) defines Q^^^ 
and hence Q*^^^ also has a primitive positive definition from P^^\ □ 

Proposition 16. CSP((1/; P(^))) is NP- complete. 

Proof. We prove NP-hardness of CSP((y; P(^), which, by the pre- 

ceding lemma and Lemma [sj implies NP-hardness of CSP((y; P^^^)). Our 
hardness proof is by reduction from positive not-all-equal 3SAT, which is 
the variant of 3SAT where all clauses have only positive literals, and the 
task is to find an assignment of Boolean values to the variables such that 
in no clause three variables are set to the same value; this problem is hard 
by Schaefer's theorem. Given an instance $ of positive not-all-equal 3SAT, 
we create an instance of CSP((y; P(^\ Q^^^)) such that has a positive 
answer if and only if $ does, as follows: for each Boolean variable x in 
$ we create two variables Ux and Vx for Moreover, we create variables 
'WC,x,y^'wc,y,z^'^C,z,x foi" every clause C of $ with variables x^y^z. The con- 
juncts of are as follows: for each clause C of $ with variables x, ^, z we 
have in 

• the conjunct P^^n^C,^,^, ^C,y,z, ^C,z,x); 

• the three conjuncts Q^^\ux,Vx,wc^z,x,'^C,x,y), 

Q^^\uy,Vy,Wc^x,y,'^C,y,z), and Q^^Xuz.Vz.Wc^y^z.'^C.z.x)- 

We then existentially quantify all variables in ^ . Now suppose that $ is 
satisfiable, and let 5 be a solution, i.e., a satisfying Boolean assignment to 
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Figure 1. Illustration for the relation T. 



the variables of Then define a graph on the variables of ^ as follows: 
there is an edge between two variables wc^ij and wcj^k iff the value of j 
under 5 is 1. Moreover, there are edges between Ui and all other variables 
iff the value of i under 5 is 1, and we impose the same condition for edges 
with Vi. The resulting graph can be assumed to be a subgraph of G, and it 
is straightforward to check that this graph is a satisfying assignment for 
in (y;P(^),(5(^)). The converse that every solution to in (1/; p(^), Q^^)) 
yields a solution to $ is left to the reader (hint: assign to a variable x of $ 
the value 1 iff there is an edge between Ux and Vx in the solution for ^^). □ 

We now turn to the reduct (V;T). For an illustration of the relation T, 
see Figure [TJ The first row shows all graphs, up to isomorphism, that are 
induced by 4-tuples from the relation T (note that T is totally symmetric, 
i.e., whenever we have a tuple in T and we permute its components, we 
obtain again a tuple in T). The other two rows show the graphs induced 
by 4-tuples of distinct elements of V that are not from T, again up to 
isomorphism. We denote the four vertices of those graphs by 1, 2, 3 and 4, 
as shown in the picture. 

Proposition 17. The structure {V] T) is first- order interdefinable with {V; i?^^)). 
Proof. The relation T is obviously not preserved by all permutations of V, 



but preserved by — . It can be checked that T is preserved by sw. Theorem 13 



implies that it is first-order interdefinable with ("\/;i?(^)). □ 

Lemma 18. Let T be a reduct of G with sw G Aut(r). Then T is contained 
in one orbit of 4-tuples with respect to F. 

Proof. For a graph on {1, 2, 3, 4} and a vertex v G {1, 2, 3, 4}, we say in the 
following that we flip the graph at v when we produce a new graph by flipping 
the edges and non-edges that are adjacent to v. Now consider the graphs of 
the first row of Figure [TJ which are those corresponding to tuples in T. If we 
flip the first one at 2, we obtain the second one; flipping the second graph 
at 3 yields the third one; and flipping the third graph at 4 yields the fourth 
one. Finally, flipping the first graph at 4 yields a graph isomorphic to the 
fifth graph. Since this flipping operation corresponds to applications of sw 
(more precisely: switches with respect to an appropriately chosen point in 
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V) to tuples in T, and isomorphism of these 4- vertex graphs to apphcations 
of appropriately chosen automorphisms of G, we get that indeed any tuple 
in T can be transformed into any other tuple in T by applications of sw and 
automorphisms of G. □ 

Before proving hardness for CSP((y;T)), we show that two auxiliary 
relations are primitive positive definable from T. 

Lemma 19. Let T' he the relation consisting of all ^-tuples with distinct 
entries in V that do not belong to T. Then T' is primitive positive definable 
in iy-T). 

Proof. Observe first that ^ is primitive positive definable from T by the 
formula 3u^v.T(x^y^u^v). Let /i be the primitive positive formula that 
states about the tuple (xi, . . . , xis) that all its entries are distinct. 

It is a well-known fact that every two-coloring of the edges of the graph 
Kis contains a monochromatic clique of size 4. Let be the conjunction over 
all atomic formulas of the form T(a, 6, c, d) where {a, 6, c, d} is a 4-element 
subset of {xi, . . . , xis}- By the above fact, (/) A fi is unsatisfiable. 

Let 00 denote the conjunction over a maximal subset of the conjuncts of 
(/) with the property that 00 A is satisfiable, and let T(yi, . . . , 1/4) denote a 
conjunct from that is not a conjunct of 0o. 

We claim that the relation defined by the formula '0 obtained from 0o A ^ 
by exist entially quantifying all variables except for yi, ... ,1/4 equals T^ To 
see this, suppose that t is a 4-tuple that satisfies ^; that means that there 
exists a 18-tuple s extending t which satisfies 00 A /x. From the maximality 
of 00, we infer that 5, and hence also t, does not satisfy T(yi, . . . ,^4), so 
t ^ T' . Therefore, '0 defines a subset of Observe also that ^ does not 
define the empty set since 0o A /x, and hence also '0, is satisfiable. Let t 



be any tuple satisfying 0^. By Proposition [17, T is preserved by — and sw] 
one can use this fact to show, as in Lemma 18, that T' is contained in one 
orbit of 4-tuples with respect to {V]T). Hence, for any tuple w in T' there 
exists an automorphism of T that sends t to w. This automorphism clearly 
preserves 0, and hence w satisfies 0, proving the claim. □ 

In our hardness proof for CSP((y;T)), the following ('link'-) relation 
L C is of central importance. For a finite subset S of V ^ write #5 for 
the parity of edges between members of S. Now define 

L := |x G I the entries of x are pairwise distinct, and 

#{xi,X2,X3} = #{X4,X5,X6}} . 

Lemma 20. The relation L is primitive positive definable in (V;T). 

Proof. By the preceding lemma, it is enough to show that L is primitive 
positive definable in (V;T'). Let (j)(x^y^z^u^v^w) be the conjunction over 
all atomic formulas of the form T\a^h^c^d) for every four-element subset 
{a, 6, c, d} of {x, ^, z, V, w}. We claim that 

:= 3^1, 1/2, 1/3.0(^1, ^2, xs, 2/1,2/2, ys) A 0(2/1, 2/2, 2/3, ^4, ^5, xq) 
defines L(xi, . . . , xq). 
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Observe first that L is preserved by sw and — . Moreover, since T' is 
preserved by sw and — , so is '0. We can therefore take the hberty of applying 
sw and — to tuples when showing the equivalence of '0 and L. 

Let t G be a tuple with pairwise distinct entries. We first show that 
when t G L then it also satisfies '0. Applying sw and — to t repeatedly, we 
can assume that N{ti,t2) A N{t2M) A N{ti,U) and N{U,t^) A Nit^.to) A 
N(t4^tQ). Choose values 51,52,53 G V for the variables ^1,^2, 1/3 such that 
{ti, . . . , te, 5i, 52, 53} induces an independent set in G. This satisfies all 
conjuncts in tp. 

For the converse, we suppose that t satisfies '0. Let 51,52,53 G be 
witnesses for ^1,^2,1/3 that show that '0 holds for t. Assume for contra- 
diction that t ^ L, i.e., #{^1,^2,^3} 7^ #{^4,^5,^6}- Applying sw and — , 
we can assume that each of the three disjoint sets {^1,^2,^3}, {^4,^5,^6}, 
and {51,52,53} induces either a clique or an independent set in G. As- 
sume without loss of generality that {51,52,53} induces a clique. Since 
# {^1 , ^2 , ^3 } ^ # {^4 , ^5 , ^6 } , precisely one of the sets {^1,^2,^3} and {^4 ,^5,^5} 
induces an independent set, and we may assume without loss of generality 
that this is the case for {^1,^2, ^3}- Because of T'(ti, ^2, ^3,51), either the re- 
lation E holds between all pairs {ti, 5i}, {^2,51}, and {^3, 5i}, or the relation 
N holds between all these pairs. Similarly, we can observe the same for 52 
and 53, so one of the two possibilities occurs twice. So assume without loss 
of generality that E{p,q) holds for any pair {p,q) with p G {^1,^2,^3} and 
q G {51, 52}. But then T'(ti, ^2, 5i, 52) does not hold, a contradiction. □ 

Proposition 21. CSP((y;r)) is NP-complete. 

Proof. Our hardness proof is by reduction from an NP-hard Boolean con- 
straint satisfaction problem to CSP((y; T, L)); NP-hardness of CSP((y; T)) 
then follows by the preceding lemma and Lemma |3j Consider 

CSP(({0, 1}; {t G {0, 1}^ I ti + t2 + t3 + t4 = 2})) , 

which is NP-hard by Schaefer's theorem [25]. From a given instance $ 
of this Boolean CSP with variables xi, . . . we create an instance of 
CSP((y;r, L)) as follows. We first describe the variables of There are 
three variables yl,y^,y^ for each a G {xi,...,x^}. Moreover, has four 
variables z^^ ^, z^^ ^, z^^ ^, z^^ ^ for each constraint C of the form a + 6 + c + 
d = 2 in $. The constraints of are as follows. For each constraint C of 
the form a + 6 + c + (i=2in$, we add the constraints 

"^{^a.b.c^ ^a,b4^ ^a.c.d^ ^b.c.d) ^-^i^a.b.c^ ^a.b.d^ ^a.c.d^ Va^ Va^ Va) 

^^(^a,b,c^ ^a,b,d^ ^b.c.d^ vhyb^Vb) 
^^i^a,b,c^ ^a,c4^ ^b,c4^ Vc^Vc^ Vc) 
^^i^a.b.d^ ^a,c4^ ^b.c.d^ Vd^ Vd^ Vd) ' 

Clearly, ^ can be computed in polynomial time from We first verify that 
if $ has a solution 5 : {xi, . . . , x^} {0, 1}, then is satisfiable as well. It 
clearly suffices to specify a graph on the variables of which satisfies the 
constraints of ^. For a G {xi, . . . ,x^}, we let {y^^y'i^y^} form a clique if 
5(a) = 1, and an independent set otherwise. Since 5 satisfies all constraints 
a + 6 + c + (i = 2of$, 5 maps exactly two of the variables a, 6, c, to 1; 
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for the sake of notation, suppose that s(a) = 5(6) = 1 (the other cases are 
handled analogously). Then the graph on the variables of has an edge 
between z^^^ and z^^^. All other pairs of variables of are not adjacent. 
It is straightforward to verify that indeed all constraints of are satisfied. 

The central observation for proving the converse is that for any tuple 
t CV^ that satisfies r(ti, ^2, ^3, ^4), there are exactly two 3-element subsets 
of {!,... ,4} such that R^^\ti,tj,tk) holds. Suppose that there is 
a solution to We define a mapping s : {xi, . . . , x^} {0, 1} as follows. 
For a G {xi, . . . , x^}, if in the solution to we have that R^^\yl, y^, y^) is 
true, then we set s{a) = 1, and otherwise we set s{a) = 0. It follows from 
the observation we just made that s satisfies all constraints of □ 



We are now ready to prove Proposition [TOj showing that we can in the 
rest of our classification project focus on those languages that contain the 
relations N, and ^. 



Proof of Proposition \Tu\ Suppose E does not have a primitive positive defi- 
nition in r. We have that E consists of just one orbit of pairs in G, and thus, 
since Aut(r) ^ Aut(G), also in T. Hence, Lemma [t] shows the existence of 
an endomorphism e of F that violates E. 

Theorem [11] states that either all endomorphisms of F are generated by its 
automorphisms, or F has a constant endomorphism, or the endomorphism 
e^, or the endomorphism e^. If F has a constant endomorphism we are in 
Case (a) and done. If F has the endomorphisms ce or e^, then we are in 
Case (b) since e^[y] and eAr[l^] induce structures in G which are invariant 
under all permutations of their domain. So assume in the following that F 
has neither e^, nor cat, nor a constant as an endomorphism, and that all 
endomorphisms of F are generated by Aut(F). 

In particular, e is generated by Aut(F). This implies that Aut(F) does 
not equal Aut(G), and hence it contains either — or sw^ by Theorem [13) 

Suppose that Aut(F) contains — . If the relation P^^^ is primitive positive 
definable in F, then CSP(F) is NP-complete by Proposition 16 and Lemmajs) 



and we are in Case (c). Otherwise, since the relation P^^Mn contained in 
one orbit of triples in F and by Lemma [7| there exists an endomorphism ei 
of F that violates P^^\ Since ei is generated by Aut(F), there is also an 
automorphism of F that violates P^^^ , and hence also R^^"^ by Proposition 
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Thus, Theorem 13 shows that Aut(F) contains sw. 

We can thus henceforth assume that Aut(F) contains sw. Then the rela- 
tion T is contained in one orbit of 4-tuples with respect to F, by Lemma [TS} 



If the relation T is primitive positive definable, then we are in Case (c) and 



CSP(F) is NP-hard by Lemma |3] and Proposition 21 Otherwise, since T 
in contained in one orbit of F, Lemma [7| implies that there is an endomor- 
phism 62 of F that violates T. Since 62 is generated by Aut(F), there is 
an automorphism of F violating T, and thus by Proposition IT7| there is an 



automorphism of F violating R^^\ It then follows from Theorem 13 that 
Aut(F) contains all permutations, bringing us back into Case (b). 

The reasoning for the case when N has no primitive positive definition 
in F is dual. So we may assume that both E and N are primitive positive 
definable. Then so is 7^ since x 7^ y iff 3 z.E{x, z) A N(y, z). □ 
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7. Higher arity polymorphisms 

In the following we will be concerned with reducts T = (l/;£^,iV, /,...) 
of G that contain the relations N and 7^. While in the last section, 
we only dealt with endomorphisms and automorphisms of reducts of G, the 
remaining cases will require the study of higher arity polymorphisms of such 
reducts. It turns out that the relevant polymorphisms proving tractability 
have, in a certain sense, regular behavior with respect to the structure of G; 
combinatorially, this is due to the fact that the set of finite ordered graphs 
is a Ramsey class^ and that one can find regular patterns in any arbitrary 
function on the random graph. We make this idea more precise. 

Definition 22. Let A be a structure. The type tp(a) of an n-tuple a of 
elements in A is the set of first-order formulas with free variables xi^ . . . 
that hold for a in A. For structures Ai, . . . , /S.j^ and tuples a^, . . . , G Ai x 

• • • X Afc, the type of (a^, . . . , a^) in Ai x • • • x A/^, denoted by tp(a^, . . . , a^), 
is the fc-tuple containing the types of (a]^, . . . , a^) in A^ for each 1 <i <k. 

We bring to the reader's attention the well-known fact that in homoge- 
neous structures in a finite language, in particular in the random graph, two 
n-tuples have the same type if and only if their orbits coincide. 

Definition 23. Let A: > 1 and let Ai, . . . , A/^^, A be structures. A type 
condition between Ai x • • • x and A is a pair (t, 5), where t is a type of 
an n- tuple in Ai x • • • x A^, and 5 is a type of an n-tuple in A, for some 
n > 1. A function / : Ai x • • • x A/^ ^ A satisfies a type condition (t^s) 
between Ai x • • • x Aj^^ and A if for all tuples a^, . . . , a"' G Ai x • • • x Aj^^ with 
tp(a^, . . . , a^) = t the n-tuple (/(aj, . . . , a^), . . . , /(a^^, . . . , a^)) has type s 
in A. A behavior is a set of type conditions between a product of structures 
Ai X • • • X and a structure A. A function from Ai x • • • x A/^ to A has 
behavior B if it satisfies all the type conditions of B. 

Definition 24. Let Ai, . . . , Aj^^^, A be structures. An operation / : Ai x 

• • • X Afc ^ A is canonical if for all types t of n-tuples in Ai x • • • x A^ there 
exists a type s of an n-tuple in A such that / satisfies the type condition 
(t, s). In other words, n-tuples of equal type in Ai x • • • x A/^; are sent to 
n-tuples of equal type in A under /. 

We remark that since G is homogeneous and has only binary relations, 
the type of an n-tuple a in G is determined by its binary subtypes, i.e., the 
types of the pairs (a^,aj), where 1 < i, j < n. In other words, the type of 
a is determined by which of its components are equal, and between which 
of its components there is an edge. Therefore, a function f : G^ ^ G is 
canonical iff it satisfies the condition of the definition for types of 2-tuples. 
The polymorphisms proving tractability of reducts of G will be canonical. 
We now define different behaviors that some of these canonical functions will 
have. For Qi, . . . , Q/c G {E^ =, 7^}, we will in the following write Qi' "Qk 
for the binary relation on that holds between two fc-tuples x^y E iS 
Qii^i^yi) holds for all 1 < z < A:. 

We start by behaviors of binary functions. 

Definition 25. We say that a binary injective operation f iV'^ ^ V is 
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• balanced in the first argument if for all u^v G V'^ we have that 
E={u,v) imphes E{f{u),f{v)) and N={u,v) imphes N{f{u),f{v))] 

• balanced in the second argument if (x^y) ^ f{y^x) is balanced in 
the first argument; 

• balanced if / is balanced in both arguments, and unbalanced other- 
wise; 

• E-dominated (N -dominated) in the first argument if for all u^v ^V'^ 
with we have that E{f{u)J{v)) {N{f{u)J{v))); 

• E-dominated (N -dominated) in the second argument if {x^y) 
f{y^x) is E'-dominated (A^-dominated) in the first argument; 

• E-dominated (N -dominated) if it is £^-dominated (iV-dominated) in 
both arguments; 

• of type min if for all u^v eV'^ with ^^{u^ v) we have E(f{u)^ f{^)) 
if and only if EE(ix, v); 

• of type max if for all u^v eV'^ with 7^7^(1^, v) we have N(f(u)^ f{^)) 
if and only if NN(n, v)] 

• of type pi if for all u^v eV'^ with ^^{u^ v) we have E(f(u)^ f{^)) if 
and only if E{ui,vi); 

• of type p2 if (x, y) ^ f{y, x) is of type pi] 

• of type projection if it is of type pi or p2. 

Note that, for example, being of type max is a behavior of binary functions 
that does not force a function to be canonical, since the condition only talks 
about certain types of pairs in G^, but not all such types; however, being of 
type max and balanced does mean that a function is canonical. The next 
definition contains some important behaviors of ternary functions. 

Definition 26. An injective ternary function / : y is of type 

• majority if for all u^v eV^ with ^^^{u^ v) we have that E{f{u)^ f{^)) 
if and only if EEE(ix, v), EEN(ix, v), ENE(ix, v), or NEE(ix, v); 

• minority if for all u,v eV^ with ^^^(u,v) we have E(f(u),f{v)) 
if and only if EEE{u,v), NNE{u,v), NEN{u,v), or ENN(i/,i;). 

While the tractability results for reducts T = (V; E", iV, 7^, . . .) of G will 
be shown by means of a number of different canonical functions, all hardness 
cases for such reducts will be established by the following single relation. 

Definition 27. We define a 6-ary relation H{xi^yi^X2^y2^xs^ys) on V by 

hj^{'^,'^,^},i^j,ue{xi,yi},ve{xj,yj} 

A {{{E{xi,yi) A N{x2, ^2) A N{xs, ys)) 

V {N{xi,yi)AE{x2,y2)/\N{xs,ys)) 

V {N{xi,yi) AN{x2,y2) AEixs.ys))) ^ 

Our goal for this section is to prove the following proposition, which states 
that if r = {V; A^, 7^, . . . ) is a reduct of G, then either H has a primitive 
positive definition in T, and CSP(r) is NP-complete, or T has a canonical 
polymorphism with a certain behavior. Each of the listed canonical poly- 
morphisms implies tractability for CSP(r), and we will present algorithms 
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proving this in Section |8j Together with Proposition [TOj we then have a 
complete proof of the dichotomy statement of Theorem [l] 

Proposition 28. Let V = {V] E^N^^^ . . .) he a reduct of G. Then at least 
one of the following holds: 

(a) There is a primitive positive definition of H in and CSP(r) is 
NP-complete. 

(b) Pol(r) contains a canonical ternary injection of type minority, as 
well as a canonical binary injection which is of type pi and either 
E-dominated or N -dominated in the second argument. 

(c) Pol(r) contains a canonical ternary injection of type majority, as 
well as a canonical binary injection which is of type pi and either 
E-dominated or N -dominated in the second argument. 

(d) Pol(r) contains a canonical ternary injection of type minority, as 
well as a canonical binary injection which is balanced and of type 
projection. 

(e) Pol(r) contains a canonical ternary injection of type majority, as 
well as a canonical binary injection which is balanced and of type 
projection. 

(f) Pol(r) contains a canonical binary injection of type max or min. 



The remainder of this section contains the proof of Proposition 28 , and is 



organized as fohows: In Subsection 7.1, we show that the relation H is hard. 



We then prove in Subsection 7.2 that if H does not have a primitive positive 



definition in a reduct F as in Proposition 28 , then F has the polymorphisms 



of one of the Cases (b) to (f) of the proposition. 

7.1. Hardness of H. This subsection contains the hardness proof of the 



relation in Case (a) of Proposition 28 



Proposition 29. CSP((y;i7)) is NP-hard. 

Proof. The proof is reduction from positive l-in-3-3SAT (one of the hard 
problems in Schaefer's classification; also see [19j). Let $ be an instance 
of positive l-in-3-3SAT, that is, a set of clauses, each having three positive 
literals. We create from $ an instance ^ of CSP((y; H)) as follows. For each 
variable x in $ we have a pair Vx of variables in When {x, y, z} is a 
clause in then we add the conjunct H{ux, Vx, Uy, Vy, Uz, Vz) to Finally, 
we existentially quantify all variables of the conjunction in order to obtain 
a sentence. Clearly, can be computed from $ in linear time. 

Suppose now that $ is satisfiable, i.e., there exists a mapping s from the 
variables of $ to {0, 1} such that in each clause exactly one of the literals is 
set to 1; we claim that (V; H) satisfies To show this, let F be the graph 
whose vertices are the variables of and which has an edge between Ux 
and Vx if x is set to 1 under the mapping s, and which has no other edges. 
By universality of G we may assume that F is a subgraph of G. It is then 
enough to show that F satisfies the conjunction of in order to show that 
{V;H) satisfies "if. Indeed, let H{ux^Vx^Uy^Vy^Uz^Vz) be a clause from ^f. 
By definition of F, the conjunction in the first line of the definition of H is 
clearly satisfied; moreover, from the disjunction in the remaining lines of the 
definition of H exactly one disjunct will be true, since in the corresponding 
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clause {x, z} of $ exactly one of the values s(y)^ s{z) equals 1. This 
argument can easily be inverted to see that every solution to "if can be used 
to define a solution to $ (in which for a variable x of $ one sets s{x) to 1 
iff in the solution to "if there is an edge between Ux and Vx). □ 

7.2. Producing canonical functions. We now show that if F = {V; E, ^ 
, . . .) is a reduct of G such that there is no primitive positive definition of H 
in r, then one of the other cases of Proposition |28] applies. By Theorem |4| 
r has a polymorphism that violates H. 

Definition 30. A function / : V is called essentially unary if it 

depends on only one of its variables; otherwise, it is called essential. 

Since any polymorphism of F preserves E and A^, and since any essentially 
unary function preserving both E and preserves all relations with a first- 
order definition in G (and in particular H), we have that if a polymorphism 
/ of r violates H, then it must be essential. Thus the following theorem 
from [8J applies. Before stating it, it is convenient to define the dual of an 
operation / on G, which can be imagined as the function obtained from / 
by exchanging the roles of E and A^. 

Definition 31. The dual of a function /(xi, . . . , x^) on G is the function 

~ f • • • 1 ~^n)- 

Theorem 32 (from [8J). Let f be an essential operation on G preserving E 
and N. Then it generates one of the following binary functions. 

• a canonical injection of type pi which is balanced; 

• a canonical injection of type max which is balanced; 

• a canonical injection of type pi which is E-dominated; 

• a canonical injection of type max which is E-dominated; 

• a canonical injection of type pi which is balanced in the first and 
E-dominated in the second argument; 

or one of the duals of the last four operations (the first operation is self-dual). 

It follows that indeed, if H does not have a primitive positive definition 
in a reduct F = (V; E, A^, 7^, . . .), then F has one of the binary canonical 



polymorphisms mentioned in Proposition 28 To complete the proof of the 



proposition, it is therefore enough to prove the following. 

Proposition 33. Suppose that f is an operation on G that preserves the 
relations E and N and violates the relation H. Then f generates a binary 
canonical injection of type min or max^ or a ternary canonical injection of 
type minority or majority. 

The remainder of this section will be devoted to the proof of this propo- 
sition. This will be achieved by refining the Ramsey-theoretic methods de- 
veloped in [8] which are suitable for investigating functions on G in several 



variables. In our proof of Proposition 33, we really would like to replace / 



by one of the "nice" functions of Theorem 32 However, the problem with 
this are the canonical binary injections of type pi, since functions of type pi 
do not violate H anymore. Hence, when simply passing to a function of the 
theorem, we lose the information that our / violates which we must use 
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at some point, since i7 is a hard relation. We are thus obhged to improve 
the method from [8J in order to keep this piece of information whilst passing 
to canonical functions. Before that, let us observe that Theorem [32] implies 
that we can restrict our attention to binary and ternary injections. 

Lemma 34. Let f be an operation on G which preserves E and N and 
violates H. Then f generates a binary or ternary injection which shares the 
same properties. 

Proof. Since the relation H consists of three orbits of 6-tuples with respect 
to G, Lemma [7| implies that / generates an at most ternary function that 
violates i7, and hence we can assume that / itself is at most ternary; by 
adding a dummy variable if necessary, we may assume that / is actually 
ternary. Moreover, / must certainly be essential, since essentially unary 



operations that preserve E and N also preserve H. Applying Theorem [32 
we get that / generates a binary canonical injection g of type min, max, or 
pi. In the first two cases we are done, since binary injections of type min 
and max violate H] so consider the last case where g is of type pi. Now 
consider 

h{x,y,z) g{g{g{f{x,y,z),x),y),z) . 
Then h is clearly injective, and still violates H - the latter can easily be 
verified combining the facts that / violates H, g is of type pi, and all tuples 
in H have pairwise distinct entries. □ 

It will turn out that just as in the proof of Lemma [34) there are two cases 



for / in the proof of Proposition |33j Either all binary canonical injections 
generated by / are of type projection, and / generates a ternary canonical 
injection of type majority or minority, or / generates a binary canonical 
injection of type min or max. We start by considering the first case, which 
is combinatorially less involved. 

Definition 35. Let Ai , . . . , A/^ and A be structures, / : Ai x • • • x A/^ A be 

a function, and let (t, s) be a type condition for such functions. If 5 is a sub- 
set of Ai x ••• x A/e, then we say that / satisfies the type condition (t, s) on S 
if for all tuples a-^, . . . , E 5 with tp(a-'^, . . . , a^) = t in Ai x • • • x A^^^ (not in 
the substructure induced by S) the n-tuple (/(aj, . . . , a^), . . . , /(aj, . . . , a^)) 
has type s in A. We say that / satisfies a behavior B on S if it satisfies all 
type conditions of B on S. 

Finally, we say that / satisfies B on arbitrarily large (finite) substructures 
of Ai X • • • X A/e if for all finite substructures Fi of A^, where 1 < i < A:, there 
exist copies F- in A^ such that / satisfies B on the product F[ x • • • x Fj^ of 
these copies. 

In the following general proposition we exceptionally use the notion "lo- 
cally generates" in its original sense (see Section [s]). The proof is a standard 
compactness argument, which we include nonetheless for the convenience of 
the reader. Similar proofs can be found, for example, in JTT\ for arbitrary 
homogeneous structures in a finite language, or for the random graph in [8J. 

Proposition 36. Let Ai, . . . , A/^ and A be homogeneous structures on the 
same countably infinite domain D, and assume that A has a finite language. 
Let moreover B be a behavior for functions from Ai x • • • x Aj^ to A, and let 
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/ : D be a function which satisfies B on arbitrarily large substructures 

o/ Ai X • • • X Afc. Then {/} U Aut(A) U Aut(Ai) U • • • U Aut(A/,) locally 
generates a function from to D which satisfies B everywhere. 

Proof. Write D = {do^di^ . . .}. We construct a sequence {gi)ieuj such that 
for all z G 

(i) gi is a function from to D locally generated by {/} U Aut(A) U 
Aut(Ai)U---UAut(A;j,); 

(ii) gi behaves like B on {(ig, • • . , 

(iii) ^i+i agrees with on {do, ... , diY . 

The sequence then defines a function g : ^ Dhj setting g{di^ , . . . , d^^) := 
9m{di-^^ . . . ^di^)^ foi" 9.ny m > ii, . . . This function g is clearly locally 
generated by {gi : i G u} by local closure, and behaves like B everywhere. 

To construct the sequence, we first construct a sequence {hi)i^^ which only 
satisfies (i) and (ii) of the requirements for the sequence [gi)i^^^. Let z G be 
given. There exist subsets Fi, . . . , F/^ of D such that Fj is isomorphic with 
{(io, . . . , (ii} as substructures of Aj for all 1 < j < A; and such that / behaves 
like B on FiX - • - xFj^. Let aj be an automorphism of Aj sending {do, . . . , di} 
onto Fj, for all 1 < j < /c; these automorphisms exist by the homogeneity 
of the Aj. Then we can set hi{xi, . . . , Xk) '•= f{ai{xi), . . . , ak{xk)). 

Now to obtain the sequence {gi)ieuj from the sequence {hi)i^(_^, let a = 
(ao, ai, . . .) be an enumeration of such that the elements of {do, . • . , di}^ 
are an initial segment of this enumeration for each i e oo (that is, they 
constitute the first (i + 1)^ entries). Denote for alH, j G cj by b^^^ the (i + 1)^- 
tuple which is obtained by applying hj to each of the first (z + 1)^ entries of 
the enumeration a. Set f'^ to be the type of 6^'-^ in A. For i^j^r^s G uj set 
fd < fr,s if 2 < r and t*'-^, t^'* agree on the variables they have in common, 
i.e., the restriction of 6^'^ to its initial segment of length (i + 1)^ has the 
same type as 6^'-^ in A. This relation defines a tree on the types t^'-^ . Since A 
is homogeneous in a finite language, for every i G there are only finitely 
many different types of (i + l)^-tuples in A. Hence, for every i G cj, there 
are only finitely many distinct types t^'-^, and so there exists a g G cj such 
that t^'^ = t^'^ for infinitely many 5 G cj. Deleting all elements of the tree 
which do not enjoy this latter property, we are thus still left with an infinite 
tree which by the above is finitely branching. Hence by Konig's lemma it 
has an infinite branch (t^'-^o, t^'-^^, . . .). Since we have reduced the tree to 
its "infinite" nodes, we may assume that the ji are strictly increasing, and 
in particular that ji > i for all i G a;. Since A is homogeneous and by 
definition of the tree, we can pick for all z G an automorphism ai of A 
which sends the initial segment of length (z + 1)^ of to 6^'-^*. Then 

setting gi := hj. o ai-i o . . . o o^o for all z G cj yields the desired sequence: 
(i) is obvious, (ii) holds since hi satisfies (ii), hj. still satisfies (ii) since 
ji > i^ and gi satisfies (ii) since the property is preserved under applications 
of automorphisms of A. (iii) is by construction. □ 

Proposition 37. Let f be an operation on G which preserves E and N and 
violates H . Suppose moreover that all binary injections generated by f are 
of type projection. Then f generates a ternary canonical injection of type 
majority or minority. 
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Proof. By Lemma 34, we can assume that / is a binary or ternary injection; 
if it was binary, it would be of type projection and thus preserve i7, so it 
must be ternary. Because / violates H, there are x , x , x e H such that 
(/(xi), . . . , /(xe)) ^ H, where Xi := {xj.xf, x^) for 1 < z < 6. 

If there was an automorphism o: of G such that a(x^) = x^ for 1 < z ^ 
j < 3, then / would generate a binary injection that still violates i7, which 
contradicts the assumption that all binary injections generated by / are of 
type projection. By permuting arguments of / if necessary, we can therefore 
assume without loss of generality that 

ENN(xi,X2), NEN(x3,X4), and NNE(x5,X6). 

We set 

S :={yeV^ \ NNN(xi, y) for ah 1 < i < 6} . 

Consider the binary relations Q1Q2QS on V^^ where Qi G {E^N} for 1 < 
i < 3; each of these relations defines a 2-type in G^. We claim that for every 
2-type t defined by one of those relations there is a 2-type 5 in G such that 
/ satisfies the type condition (t, s) on S. To prove the claim, fix a relation 
Q1Q2QS and let u^v ^ S he such that QiQ2Qs(u^v) holds; we must show 
that whether E{f{u)^ f{^)) oi" ^ifi^)^ /('^)) depends only on Q1Q2Q3 (and 
not on u^v). We go through all possibilities of Q1Q2Q3' 

(1) Q1Q2Q3 = ENN. Let a G Aut(G) be such that (x^, ^2) 
is mapped to {xf, X2,us,vs); such an automorphism exists since 
NNN(xi, u), NNN(xi, i;), NNN(x2, i/), NNN(x2, v), and since (x^, xj) 
has the same type as (xf,X2), and {u2,V2) has the same type as 
('^3, '^3)- By assumption, the operation g defined by g{x^y) := 
/(x, a{y)) must be of type projection. Hence, E(g(ui^ '^2), ^('^1, '^2)) 

iff £^(^(xj, x^), ^(x2, X2)). Combining this with the equations {f{u), f{v)) 
{g{ui,U2),g{vi,V2)) and (^(xj, x?), ^(x^, x|)) = (/(xi), /(X2)), we 
get that E{f{u), f{^)) iff E{f{xi), f{x2))^ and so we are done. 

(2) Q1Q2Q3 = NEN or Q1Q2Q3 = NNE. These cases are analogous to 
the previous case. 

(3) Q1Q2Q3 = NEE. Let a be defined as in the first case. By as- 
sumption, the operation defined by f(x^y^a(y)) must be of type 
projection. Reasoning as above, one gets that E{f{u)^f{v)) iff 
A^(/(xi),/(x2)). 

(4) Q1Q2Q3 = ENE or Q1Q2Q3 = EEN. These cases are analogous to 
the previous case. 

(5) Q1Q2Q3 = EEE or Q1Q2Q3 = NNN. These cases are trivial since / 
preserves E and N. 

To show that / generates an operation of type majority or minority, by 
Proposition |36] it suffices to prove that / generates a function that satisfies 
the behavior defining type majority or minority on 5, since S contains copies 
of products of arbitrary finite substructures of G. We show this by another 
case distinction, based on the fact that (/(xi), . . . , /(xe)) ^ H. 

(1) Suppose that £;(/(xi), /(X2)), £;(/(x3), /(X4)), E(f(x,), /(xe)). Then 
by the above, / itself satisfies the minority behavior on S. 

(2) Suppose that N{f{xi), /(xa)), iV(/(x3), /(X4)), A^(/(x5), /(xe)). Then 
/ behaves like a majority on S. 
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(3) Suppose that E(f(xi)J(x2))^E{f{xs)J{x^)),N(f(x,)J(xe)). Let 
e be a self- embedding of G such that for all w ^V^ sill < j < 3, and 
all 1 < z < 6 we have that N{xj, e{w)). Then the ternary operation 
defined by /(x, e(/(x, z))) is of type majority on S. 

(4) Suppose that E{f{xi)J{x2)),N{f{x3)J{x^)),E{f{x^)J{xe)), or 
7V(/(xi),/(x2)), E{f{x3)J{x4)), E{f{x^)J{xG)). These cases are 
analogous to the previous case. 

Let z) be a ternary injection of type majority or minority generated 

by /; it remains to make h canonical. By Theorem [32| / generates a binary 
canonical injection g{x,y), which is of type projection by our assumption 
on /. Assume wlog that it is of type pi and set t{x,y,z) := g{x, g{y, z)). 
Then the function h{t{x^y^ z)^t{y, z,x),t{z,x,y)) is still of type majority 
or minority and canonical; we leave the straightforward verification to the 
reader. □ 

Having proven Proposition [371 it is enough to show the following propo- 



sition in order to obtain a full proof of Proposition 33 



Proposition 38. Let f : ^ V be a binary injection preserving E and N 
that is not of type projection. Then f generates a binary canonical injection 
of type min or of type max. 

In the remainder of this section we will prove this proposition by Ramsey 
theoretic analysis of /, which requires the following definitions and facts 
from [8J. 

Equip V with a total order ^ in such a way that {V;E, ^) is the ran- 
dom ordered graph, i.e., the unique countably infinite homogeneous totally 
ordered graph containing all finite totally ordered graphs (for existence and 
uniqueness of this structure, see e.g. [21]). The order (V;^) is then iso- 
morphic to the order (Q; <) of the rationals. The ordered random graph 
has the advantage of being a so-called Ramsey structure, i.e., it enjoys a 
certain combinatorial property (which the random graph without the order 
does not) - see for example [9]. Using this Ramsey property, starting from a 
function on {V; ^) one can generate a canonical function whilst keeping 
such information as violation of a relation. Our combinatorial tool will be 
the following proposition, which has first been used in [8J in a slightly simpler 
form, and which has been stated in full generality for ordered homogeneous 
Ramsey structures in [llj. 

Proposition 39. Let f : ^ V be a function, and let c^, . . . G V^. 

Then f generates a function which is canonical as a function from (V; E, ^ 
, c];, . . . , c^) X • • • X (V; E, ^, c^, . . . , c^) to {V; E, and which is identical 
with f on {cj, . . . , c^"} x • • • x {4, . . . , c^}. 

The global strategy behind what follows now is to take a binary injection 
/ and fix a finite number of constants G V'^ which witness that / is not of 



type projection. Then, using Proposition 39, we generate a binary canonical 
function which is identical with / on all c^; this canonical function then still 
is not of type projection, and can be handled more easily as it is canonical. 
However, we do not present the proof like that for the reason that there 
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would be too many possibilities of canonical functions for primitive case- 
by-case analysis. What we do instead is rule out behaviors of canonical 
functions more systematically, for example before even adding constants to 
the language. As in [8J, let us define the following behaviors for functions 
from (!/;£;, to {V;E). 

Definition 40. Let f : V'^ ^ V he injective. If for ah u,v e V'^ with 

ui vi and U2 ^ V2 we have 

• E(f(u)^ f{^)) if ^iid only if EE(i/,, v), then we say that / behaves like 
min on input ^). 

• N{f{u),f{v)) if and only if NN(i/,i;), then we say that / behaves 
like max on input ^). 

• E{f{u)^f{v)) if and only if E{ui,vi), then we say that / behaves 
like pi on input ^). 

• E{f{u)^f{v)) if and only if E(u2,V2), then we say that / behaves 
like p2 on input ^). 

Analogously, we define behavior on input (^, >-) using pairs u,v ^ with 
ui ^ vi and U2 y V2- 

Of course, we could also have defined "behavior on input (^, and 
"behavior on input (^,^)"; however, behavior on input (^,>^) equals be- 
havior on input (^, ^), and behavior on input (>^,^) equals behavior on 
input (^,>-). Thus, there are only two kinds of inputs to be considered, 
namely the "straight input" ^) and the "twisted input" >^). 

Proposition 41. Let f : ^ V be injective and canonical as a function 
from (V;E,^)'^ to (V;E,^), and suppose it preserves E and N. Then it 
behaves like min^ max^ pi or p2 on input ^) (and similarly on input 

Proof By definition of the term canonical; one only needs to enumerate all 
possible types of pairs {u^v) ^V'^. □ 

Definition 42. If an injection f : ^ V behaves like X on input (^, ^) 
and like Y on input (^,>-), where X, y G {max, min, pi, 7)2} , then we say 
that / is of type X/Y. 

We would like to emphasize that the term "canonical" depends on the 
structures under consideration; that is, a function f : V'^ ^ V might be 
canonical as a function from (V; E, to (V; E, ^), but not as a function 
from (V]E)'^ to (V;E), and vice-versa. In the following, we will for this 
reason carefully specify the structures we have in mind when using this 
term. 

Observe that canonical functions from (V; E, to (V; E, ^) also behave 
regularly with respect to the order ^: this implies, for example, that / is 
either strictly increasing or decreasing with respect to the pointwise order. 
Fix from now on any automorphism a of the graph G that reverses the 
order on V. By applying a to a canonical function if necessary, we may (in 
the presence of Aut(G)) always assume that all canonical functions / we 
use are strictly increasing. Having that, one easily checks that one of the 
implications 

Ui -< Vi AU2 ^ V2 ^ f{u) ^ f{v) 
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and 

Ui ^ Vi AU2 -< V2 ^ f{u) ^ f{v). 

hold. In the first case, we say that / obeys pi for the order, in the second 
case / obeys p2 for the order. By switching the variables of /, we can always 
achieve that / obeys pi for the order. 

Lemma 43. Suppose that f : V'^ ^ V is injective and canonical as a 
function from {V; E, to (V; and suppose that it is of type max /pi 

or of type pi/ max^ where i G {1,2}. Then f generates a binary injection of 
type max. 

Proof. Assume wlog that / is of type max /pi, and note that we may assume 
that / obeys pi for the order. Set h{x,y) := f(x,a(y)). Then h behaves 
like Pi on input ^) and like max on input moreover, h(u) ^ h(v) 

iff f{u) f{v), for all u,v ^V'^ with ui ^ vi and U2 ^ V2- We then have 
that g(x, y) := f{f{x, y), y)) is of type max / max, which means that it 
is of type max when viewed as a function from to G. □ 

Lemma 44. Suppose that f : V'^ ^ V is injective and canonical as a 

function from (V; E, to (V; E, and suppose that it is of type min /pi 

or of type pi/ min, where i G {1,2}. Then f generates a binary injection of 
type min. 

Proof. The dual proof works. □ 

Lemma 45. Suppose that f : V'^ ^ V is injective and canonical as a func- 
tion from (V; E, to {V; E, and suppose that it is of type max / min 
or of type min / max. Then f generates a binary injection of type min. 

Proof. Assume wlog that / is of type max / min, and remember that we may 
assume that / obeys pi for the order. Then g{x, y) := /(x, /(x, y)) is of type 



max /pi and generates a binary injection of type max by Lemma 43, □ 

We next deal with the last remaining mixed behavior, pi/p2, by combining 
operational with relational arguments. 

Lemma 46. Let T = (V; E, A^, 7^, . . .) be a reduct of G which is preserved 
by a binary injection of type pi. Then the following are equivalent. 

(1) r has a binary injective polymorphism of behavior min. 

(2) For every primitive positive formula (j) over T, if (j) A N[xi,X2) A 
Ai<i<j<4^i 7^ ^^^0AA^(x3,X4)A/\;L<i<j<4^^ 7^ satisfiable 
over then (j) A A^(xi, X2) A N{xs, X4) is satisfiable over T as well. 

(3) For every finite F CV^ there exists a binary injective polymorphism 
ofT which behaves like min on F. 

Proof. The implication from (1) to (2) follows directly by applying a bi- 
nary injective polymorphism of behavior min to tuples r, s satisfying A 
A^(xi, X2) A Ai<i<j<4 7^ aiid (/) A A^(x3, X4) A Ai<i<j<4 7^ respec- 
tively. 

To prove that (2) implies (3), assume (2) and let F C V'^ he finite. 
Without loss of generality we can assume that F is of the form {ei, . . . , e^}^, 
for sufficiently large n. Let A be the structure induced by F in F^. We 
construct an injective homomorphism h from A to F with the property that 
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for all u^v e F with EN(?i,'z;) or NE(?i,'z;) we have N {h{u) ^ h{v)) . Any 
homomorphism from A to F, in particular /i, can clearly be extended to a 
binary polymorphism of F, for example inductively by using the universality 
of G. Such an extension of h then behaves like min on F. 

To construct /i, consider the formula 00 with variables Xij for 1 < i, j < n 
which is the conjunction over all literals R{xi-^^j-^ , • • • , ^ikdk) such that i? is a 
relation in F and R{ei^^ . . . , e^^) and R{ej^^ . . . , ej^) hold in F. So 00 states 
precisely which relations hold in F^ on elements from F. Since F is preserved 
by a binary injection, we have that 01 := 4^o/\ Ai<ij,k,l<n,(ij)^(k,l) ^hj 
is satisfiable in F. 

Let P be the set of pairs of the form ((^1,^2), (ji, ^2)) with ii,i2,ji,j2 ^ 
{l,...,n}, ii ^ ji, i2 ^ j2, and where iV(e^i,e^J or N{ei^,ej^). We show by 
induction on the size oil Q P that the formula 4>iA/\(^(^-_^ -^^^ ^-^ j2))ei ^(^n,*2 5 ^ 
is satisfiable over F. Note that this statement applied to the set / = P gives 
us the a homomorphism h from A to F such that for all a, 6 E F we have 
N{h{a)^h{b)) whenever EN(a,6) or NE(a,6) by setting h{ei^ej) := ^(xij), 
where s is the satisfying assignment for 0i A A((n,22),0i,i2))GP ^i^h,i2^ ^01,32)- 

For the induction beginning, let p = ((^1,^2), (ji, ^2)) be any element of 
P. Let r, s be the n^-tuples defined as follows. 

r := (ei, . . . , ei, 62, . . . , 62, . . . , e^, . . . , e^) 

s := (ei, 62, ... , en, ei, 62, . . . , e^, . . . , ei, 62, . . . , e^) 

In the following we use double indices for the entries of n^-tuples; for exam- 
ple, r = (ri^i, . . . , ri^n^ '^2,1, • • • , '^n,n)- The two tuples r and s satisfy 0o- To 
see this observe that by definition of 0o the tuple 

((ei, ei), . . . , (ei, e^), (e2, ei), . . . , (e^, e^)) 

satisfies 0o T^; since r and s are projections of that tuple onto the first and 
second coordinate, respectively, and projections are homomorphisms, r and 
s satisfy 0o as well. Let ^ be a binary injective polymorphism of F which is of 
type ^1, and set := ^(r, s) and := g{s^ r). Then and 5' satisfy 0i since 
g is injective. Since p G P, we have that N{ei-^^ej-^) or N{ei^^ej^). Assume 
that N{ei-^,ej-^)] the other case is analogous. Since Vi-^^i^ = ^ii^^jij2 — ^ji^ 
r' := ^(r, 5), and ^ is of type pi, we have that N{r[^ ^2)' pi'oving that 
01 A ^(^ii,z2 5 ^ji,j2) is satisfiable in F. 

In the induction step, let / C P be a set of cardinality n > 2, and assume 
that the statement has been shown for subsets of P of cardinality n — 1. 
Pick any distinct qi,q2 G /. Set 

:= 01 A /\ ^(^n,i2^^ii,i2) 

((^i,^2),(ji,j2))G/\{gi,g2} 

and observe that '0 is a primitive positive formula over F since F contains the 
relations A/" and 7^. Write g'l = ((t^i, 1^2), ('^1, '^2)) and = (('^i? '^2)' ('^i' '^2))- 
Then the inductive assumption shows that each of '0 A N{xu-^^u2^^vi,v2) and 
ijj A N {xy^'^^y^i^^ x^'^^y'^) IS satisfiablc in F. Note that '0 contains in particular 
conjuncts that state that the four variables Xui^u2^^vi,v2^^u'-^,u'^^^v[,v'^ de- 
note distinct elements. Hence, by (2), the formula ip A N{xui^u2^^vi,v2) ^ 
N(xy^f^y^^Xyf^y^) is satisfiable over F as well, which is what we had to show. 
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The implication from (3) to (1) follows from Proposition 36, □ 



Lemma 47. Let f : ^ V be a binary injection of behavior pi/p2 which 
preserves E and N . Then f generates a binary injection of type min and a 
binary injection of type max. 



Proof. By Theorem [32j / generates a binary injection of type max, min, or 
Pi- 

Suppose first that it does not generate a binary injection of type max or 
min; we will lead this to a contradiction. Let F be the reduct of G which has 
all relations that are first-order definable in G and preserved by /. Since / 
generates a binary injection of type pi, we may apply implication (2) (1) 
from Lemma |46j Let be a primitive positive formula with variable set 5, 
{xi, . . . , X4} C such that the formulas A N(xi^ X2) A Ai<i<j<4 7^ 
and (/) A N(xs, X4) A Ai<i<j<4 7^ have in F the satisfying assignments r 
and s from S ^ V, respectively. 

We can assume without loss of generality that r(xi) ^ r{x2) and r{xs) -< 
r(x4); otherwise, since r(xi), . . . , r(x4) must be pairwise distinct, we can 
apply an automorphism of G to r such that the resulting map has the 
required property. Similarly, by applying an automorphism of G to 5, we 
can assume without loss of generality that s(xi) ^ 5(^2) and 5(^4) s(xs). 
Then the mapping t : S ^ V defined by t{x) = f{r(x),s{x)) shows that 
(j) A N{xi^ X2) A N{xs^ X4) is satisfiable in F: 

• The assignment t satisfies since / is a polymorphism of F. 

• We have that N{t{xi),t{x2)) since r(xi) ^ r(x2), s{xi) s{x2), f 
is of type pi on input (^, ^), and N{r{xi),r{x2)). 

• We have that N{t{xs),t{x4)) since r{xs) ^ r{x4), s{xs) y 5(x4), / 
is of type p2 on input (^, and N{s{xs), s{x4)). 



By Lemma [46) we conclude that F is preserved by a binary injection of 
type min, and consequently / generates a binary injection of type min - a 
contradiction. 

Therefore, / generates a binary injection of type max or min. Since the 
assumptions of the lemma are symmetric in E and we infer a posteriori 
that / generates both a binary injection of type max and a binary injection 
of type min. □ 



Having ruled out some behaviors without constants, we now examine 
behaviors when we add constants to the language. In the sequel, we will 
also say that a function f : ^ V has behavior B between two points 
u^v ^V^ if it has behavior B on the structure {u, v}. 

Lemma 48. Let u G , and set U \^ {V \ {ui}) x (y \ {U2}). Let 
f : ^ V be a binary injection which preserves E and behaves like 
pi between all points v,w ^ U, and which behaves like p2 between u and all 
points in U . Then f generates a binary injection of type min as well as a 
binary injection of type max. 

Proof. Let F be the reduct of G having all relations that are first-order de- 
finable in G and preserved by /. Since U contains copies of products of 
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arbitrary finite graphs, / behaves like pi on arbitrarily large finite substruc- 
tures of G^, and hence generates a binary injection of type pi by Proposi- 



tion 36 Hence F is also preserved by such a function, and we may apply 



the implication from (2) to (1) in Lemma 46 to F. 

Let be a primitive positive formula with variable set 5, {xi, . . . , X4} C 5, 
such that 0A7V(xi,X2) AAi<i<j<4^i 7^ xj and 0A7V(x3, X4) A Ai<i<j<4 7^ 
X j are satisfiable over F, witnessed by satisfying assignments r^s : S ^ 
respectively. 

Let a be an automorphism of G that maps r{xs) to ui, and let /3 be an 
automorphism of G that maps s{xs) to U2. Then {a{r(xs)), P(s(xs))) = 
and V := (a(r(x4)), /3(5(x4))) E ?7 since Q:(r(x4)) 7^ a{r{xs)) = ixi and 
/3(5(x4)) 7^ /3(5(x3)) = Thus, / behaves like p2 between u and and 
since s satisfies N(xs, X4), we have that t : S ^ V defined by 

t{x) = f{a{x),/3ix)) 

satisfies A^(x3,X4), too. Since a^/3^f are polymorphisms of F, the assign- 
ment t also satisfies (p. To see that t also satisfies A^(xi,X2), observe that 
a{r{xi)) 7^ a(r(x3)) and /3(s(xi)) 7^ /3(5(x3)), and hencep := (a(r(xi))^ /3(s(xi))) ^ 

Similarly, q := (a(r(x2)), /3(5(x2))) ^ Hence, / behaves as j)i between 
p and and since r satisfies A^(xi,X2), so does t. 



By Lemma 46 we conclude that F is preserved by a binary injection of 
type min, and consequently / generates a binary injection of type min. 

Since our assumptions on / were symmetric in E and it follows that 
/ also generates a binary injection of type max. □ 

Lemma 49. Let u G y^ and set U (V \ {ui}) x (V \ {U2}). Let 

f : ^ V be a binary injection which preserves E and N, behaves like pi 
between all points v^w ^ U, and which behaves like min between u and all 
points in U . Then f generates a binary injection of type min. 

Proof. The proof is identical with the proof in the preceding lemma; note 
that our assumptions on / here imply more deletions of edges than the 
assumptions in that lemma, so it can only be easier to generate a binary 
injection of type min. □ 

Lemma 50. Let u,v ^V^ such that 7^7^(1^, v) and set W := (V\{ui, vi}) x 
(V \ {u2,V2}). Let f ^ V be a binary injection that 

• preserves E and N 

• behaves like pi between all points w^r 

• behaves like pi between u and all points w 

• behaves like pi between v and all points w 

• does not behave like pi between u and v. 

Then f generates a binary injection of type min as well as a binary injection 
of type max. 

Proof. We have to consider the case that EN(ix,i;) and N{f{u)^f{v))^ and 
the case that NE(i/,'u) and E{f{u), f{^))- In the first case we prove that / 
generates a binary injection of type min; it then follows by duality that in 
the second case, / generates a binary injection of type max. 
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As in Lemma 48, we apply the implication (2) (1) from Lemma 46, 
Let r, 0, 5, xi, . . . , X4, r, and s be as in the proof of Lemma |48| by the 
same argument as before, F is preserved by a binary injection of type pi. 

If N(r(xs)^r{x4)), then the assignment r shows that (j) A N(xi^X2) A 
N(xs^X4) is satisfiable and we are done. Otherwise, since r(xs) 7^ ^(x^), 
we have E{r{xs)^r{x4)). Therefore, there is an a G Aut(G) such that 
{a{r{xs)), a(r(x4))) = (ixi, vi). Similarly, since N(s{xs)^ s(x4)) and N{u2^ ^^2), 
there is a /3 G Aut(G) such that {I3{s{xz))^ I3{s{x4))) = (1^2, '^2)- We claim 
that the map t \ S defined by 

t{x) = f{a{x),(3{x)) 

is a satisfying assignment for (j) A N{xi^X2) A A^(x3,X4). The assignment t 
satisfies (j) since and / are polymorphisms of F. Then N{t{x^)^t{x4)) 
holds because (a(r(x3)), /3(5(x3))) = u and (Q^(r(x4)), /3(5(x4))) = and 
N{f{u)^f{v)). To prove that N{t{xi)^t{x2)) holds, observe that r{xi) ^ 
r{x^) and r{xi) ^ ^(^4), and hence a{r{xi)) ^ {0^(^(3:^3)), 0^(^(3:^4))} = 
Similarly, I3{s{xi)) ^ {li{s{x^)), I3{s{xa,))} = {1^2,^2}- Hence, 
(a(r(xi)), G VF. A similar argument for X2 in place of xi shows 
that {a{r{x2))^ I3{s{x2)) G W. Since / behaves like pi between all points of 
VF, and since r satisfies A^(xi,X2), we have proved the claim. This shows 
that F is preserved by a binary injection of type min, and hence / generates 
such a function. 

By symmetry of our assumptions on / in and N ^ it follows that / 
generates a binary injection of type min if and only if it generates a binary 
injection of type max. □ 



We are now set up to prove Proposition 38 , and hence complete the proof 



of Proposition 33 



Proof of Proposition Let / be given. By Theorem [321 / generates a 
binary canonical injection g of type projection, min, or max. In the last two 
cases we are done, so consider the first case. We claim that / also generates 
a (not necessarily canonical) binary injection h of type min or max. Then 
h(g(x^y)^ g(y^x)) is still of type min or max and in addition canonical, and 
the proposition follows. 

To prove our claim, fix a finite set C := {ci, . . . , c^} C V such that the 
fact that / does not behave like a projection is witnessed on C. Invoking 
Proposition [39} we may henceforth assume that / is canonical as a function 
from {V; ^, ci, . . . , c^)^ to (V; -<) (and hence also to {V; E) since tu- 
ples of equal type in {V]E^ ^) have equal type in {V^E)). It is clear that 
this new / must be injective. 

In the following we will consider orbits of elements in the structure {V] E, ^, 
The infinite orbits are precisely the sets of the form 

{i; G y I Qi(v^ Ci) and Ri{v^ q) for all 1 < z < m}, 

for Qi, . . . , Qm ^ {E, N}^ and . . . , Rm G The finite orbits are of 

the form {q} for some 1 < i < m. It is well-known that each infinite orbit 
of (V; ^, ci, . . . , Cjn) contains copies of arbitrary linearly ordered finite 
graphs, and in particular, forgetting about the order, of all finite graphs. 
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Therefore, if / behaves hke min or max on an infinite orbit of (V; 
, ci, . . . , c^), then by Proposition 36 it generates a function which behaves 
hke min or max everywhere, and we are done. Moreover, if / is of mixed type 
on an infinite orbit, then, again by Proposition 36, / generates a canonical 
function which has the same mixed behavior everywhere. But then we are 
done by Lemmas 43, 44, 45, and 47 Hence, we may henceforth assume that 
/ behaves hke a projection on every infinite orbit. Fix in the fohowing an 
infinite orbit O and assume without loss of generality that / behaves like pi 
on O. 

Now suppose that there exists an infinite orbit W such that / behaves like 
P2 between all points u ^ and v G for which ui vi and U2 ^ V2' 
Then fix any v G W^, and set Oi := {o G O | o ^ vi} and O2 := {o G 
O I o ^ V2}. Set 0[ := Oi U {vi} and := O2 U {V2}. We then have that / 
behaves like p2 between v and any point u of (0[ \ {vi}) x (O2 \ {'^2}), and 
like pi between any two points of (0[ \ {vi}) x (O2 \ {'^2})- Since (O^; E, Vi) 
contains copies of all finite substructures of (V;E,Vi)^ for i G {1,2}, by 
Proposition 36 we get that / generates a function which behaves like p2 
between v and any point u of (V \ {vi}) x (V \ {v2})^ and which behaves 



like pi between any two points of (T^ \ {^'i}) x (y \ {'^2})- Then Lemma 48 
implies that / generates a binary injection of type min and we are done. 

This argument is easily adapted to any situation where there exists an 
infinite orbit W such that / behaves like p2 between all points u ^ and 
V G with Ri{ui,vi) and i?2('^2, '^2), for i?i,i?2 ^ {^5^}- 

When there exists an infinite orbit W such that / behaves like min be- 
tween all points u ^ and v G with vi) and i?2('^2, '^2), then we 
can argue similarly, invoking Lemma 48 at the end. Replacing min by max 
we can use the dual argument, with the notable difference that / generates 
a binary injection of type max rather than min. 

Since / is canonical, one of the situations described so far must occur. 
Putting this together, we conclude that for every infinite orbit W and all 
points u G and v G W^, f behaves like pi between u and v. Having 
that, suppose that for an infinite orbit VF, / behaves like p2 on W. Then 
exchanging the roles of O and W and of pi and p2 above, we can again 
conclude that / generates a binary injection of type min. We may thus 
henceforth assume that / behaves like pi between all points u^v ^ (T^ \ C)^. 

Pick any G C^. Suppose that there exists v ^ {V \ C)^ such that 
/ does not behave like pi between u and v; say wlog that EN(i/, v) and 
N{f{u)J{v)). Let Oi be the (infinite) orbit ofvi, for i G {1, 2}. Then for ah 
'L' G Oi XO2 we have EN(i/, v) and N{f{u), f{^)) since / is canonical. Now let 
w G O2XO1. We distinguish the two cases /(it;)) and /(it;)). 

In the first case, / behaves like p2 between u and all v G (Oi U02)^. We can 
then argue as above and are done. In the second case, / behaves like min 
between u and all v G (OiU02)^, and we are again done by the corresponding 
argument above. We conclude that we may assume that for all u ^ and 
all V G (V \C)^, f behaves like pi between u and v as well. 

Now pick u^v ^ such that / does not behave like pi between u and 
say wlog EN(i/,i;) and N{f{u),f{v)); this is possible since the fact that / 
does not behave like pi everywhere is witnessed on C. Pick any 16 infinite 
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orbits Oi,...,Oi6 such that for all Qi,Q2,^i,^2 ^ {E^N} there exists 
w E (OiU - • -UOie)^ with QiQ2{u,w) and RiR2{v,w). Set S^i := {ui,vi}U 
Oi U • • • U Oi6 and 5^2 := {1^2, '^2} U Oi U • • • U Oie- Then Si contains copies 
of all finite substructures of (V; i/^, i;^), for z G {1, 2}, and hence applying 



Proposition 36 to functions from {V-.E^ui^vi) x (1/; iX2, '^2) to {V'.E) we 
see that / generates a function which behaves like p2 between u and like 
pi between u and all points w G {y\{ui^ vi]) x {y\{u2^ '^2}), likepi between 
V and all points w ^ {V\ {ui^ vi]) x (y \ {u2^ '^2}), and like pi between any 
two points w^r ^ (y \ {ui^vi}) x (V \ {u2^ ^2})- But then we are done by 



Lemma [501 D 

8. Algorithms 



We now prove that if one of the Cases (b) to (f ) of Proposition [28] holds for 
a reduct F = {V] E, N,^, . . .) of G with a finite language, then CSP(r) is in 
P. Tractability of Cases (b) and (c) is shown in Subsection 
of Case (d) in Subsection |8.2| of Case (e) in Subsection 
tractability of Case (f) in Subsection |8.4[ 



.1 



8.3 



tractability 
and finally 



8.1. Tractability of types minority / majority with unbalanced pro- 
jections. We show tractability of the CSP for reducts F as in Cases (b) 



and (c) of Proposition 28 



Proposition 51. Let T = (V; E, N,y^, . . .) be a finite language reduct of 
G, and assume that Pol(r) contains a ternary injection of type minority 
or majority, as well as a binary injection which is of type pi and either 
E-dominated or N -dominated in the second argument. Then CSP(r) is 
tractable. 

It turns out that for such F, we can reduce CSP(F) to the CSP of the 
injectivization of F. This implies in turn that the CSP can be reduced to a 
CSP over a Boolean domain. 

Definition 52. A relation is called injective if all its tuples have pairwise 
distinct entries. A structure is called injective if it only has injective rela- 
tions. 

With the goal of reducing the CSP to injective structures, we define in- 
jectivizations for relations, atomic formulas, and structures. 

Definition 53. • Let R be any relation. Then the injectivization of 

i?, denoted by inj(i?), is the largest injective relation contained in 
R. 

• Let . . . , Xji) be an atomic formula in the language of a reduct 
F, where 3y ]^ , . . . , IS a list of the variables that appear in (j). Then 
the injectivization of . . . , Xn) is the formula . . . , Xn)^ 

where i?^^"^ is a relation symbol which stands for the injectivization 
of the relation defined by (j). 

• The injectivization of a relational structure F, denoted by inj(F), is 
the relational structure A with the same domain as F whose rela- 
tions are the injectivizations of the atomic formulas over F, i.e., the 
relations i?^"^. 
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// Input: An instance ^ of CSP(r) with variables V 

While <l> contains a constraint (j) that implies x = y for x^y e V do 

Replace each occurrence of x by in <l>. 

If ^ contains a false constraint then reject 
Loop 

Accept if and only if inj($) is satisfiable in A. 



Figure 2. Polynomial-time Turing reduction from the 
CSP(r) for r closed under an unbalanced binary injection, 
to the CSP of its injectivization A. 

Note that inj(r) also contains the inject ivizations of relations that are 
defined by atomic formulas in which one variable might appear several times. 
In particular, the injectivization of an atomic formula (f) might have smaller 
arity than the relation symbol that appears in (f). 

To state the reduction to the CSP of an injectivization, we also need the 
following operations on instances of CSP(r). 

Definition 54. Let F be a structure in a finite language, A be the injec- 
tivization of F, and $ be an instance of CSP(F). Then the injectivization 
of denoted by inj($), is the instance of CSP(A) obtained from $ by 
replacing each conjunct . . . , Xn) of $ by . . . , x^). 

We say that a constraint (^conjunct) in an instance of CSP(F) is false 
if it defines an empty relation in F. Note that a constraint . . . 

might be false even if the relation R is non-empty (simply because some of 
the variables from xi, . . . , might be equal). 

Lemma 55. Let T be a finite language reduct of G which is preserved by 
a binary injection f of type pi that is E-dominated or N -dominated in the 
second argument. Then the algorithm shown in Figure \^is a polynomial- 
time reduction o/CSP(F) to CSP(A)^ where A is the injectivization ofV. 

Proof. By duality, we may in the following assume that / is E'-dominated 
in the second argument. 

In the main loop, when the algorithm detects a constraint that is false and 
therefore rejects, then $ cannot hold in F, because the algorithm only con- 
tracts variables x and y when x = ^ in all solutions to $ - and contractions 
are the only modifications performed on the input formula So suppose 
that the algorithm does not reject, and let be the instance of CSP(F) 
computed by the algorithm when it reaches the final line of the algorithm. 

By the observation we just made it suffices to show that holds in F if 
and only if inj(^^) holds in A. It is clear that when inj(^^) holds in A then 
^ holds in F (since the constraints in inj(^^) have been made stronger). We 
now prove that if ^ has a solution 5 in F, then there is also a solution for 
inj(^^) in A. 

Let s' be any mapping from the variables of to G such that for all 
distinct variables x, y of we have that 

• iiE{s{x),s{y)) then E{s\x), s\y))', 

• if N{s{x), s{y)) then N{s\x), s\y))] 
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• if s(x) = s{y) then E{s\x)^ s\y)). 
Clearly, such a mapping exists. We claim that is a solution to in F. 
Since must be injective, it is then clearly also a solution to inj(^^). 

To prove the claim, let (f) = . . . , x^) be a constraint in "if. Since 

we are at the final stage of the algorithm, we can conclude that (/) does 
not imply equality of any of the variables xi^ . . . ^Xn^ and so there is for 
all 1 < z < j < n an n- tuple such that R{t^'^^'>) and U 7^ tj hold. 
Since R{xi, . . . , Xn) is preserved by a binary injection, it is also preserved by 
injections of arbitrary arity (it is straightforward to build such terms from a 
binary injection). Application of an injection of arity (2) to the tuples t^*'-^^ 
shows that . . . , Xn) contains an injective tuple t = (ti, . . . , t^). 

Consider the mapping r : {xi, . . . G given by r(xi) := f{s{xi)^ti). 

This assignment has the property that for all 1 < j < if E{s{xi)^ ^{^j)) 
then E{r{xi)^r{xj)), and if N(s{xi), s{xj)) then N{r{xi)^r{xj))^ because / 
is of type pi and because the entries of t are distinct. Moreover, if s{xi) = 
s{xj) then E{r{xi)^r{xj)) because / is £J-dominated in the second argument. 
Therefore, {s'{xi)^ . . . , s'{xn)) and {r{xi)^ . . . , r{xn)) have the same type in 
G. Since / is a polymorphism of F, we have that (r(xi), . . . , r{xn)) satisfies 
the constraint . . . Hence, s' satisfies R{xi^ . . . X as well. In 

this fashion we see that s' satisfies all the constraints of proving our 
claim. □ 

To reduce the CSP for injective structures to Boolean CSPs, we make the 
following definition. 

Definition 56. Let t be a fc-tuple of distinct vertices of G, and let q be (2). 

Then Boole(t) is the g-tuple (ai,2, ^1,3, • • • , ai^k, ^2,3, • • • , cik-i,k) ^ {0, 1}^ 
such that aij = if N(ti^ tj) and aij = 1 if E(ti^tj). If i? is a fc-ary injective 
relation, then Boole(i2) is the g'-ary Boolean relation {Boole(t) \ t e R}. If 
(/) is a formula that defines a relation R over G, then we also write Boole((/)) 
instead of Boole(inj(i?)). Finally, for an injective reduct F, we write Boole(F) 
for the structure over a Boolean domain which has the relations of the form 
Boole (i?), where J? is a relation of F. 

Lemma 57. Let F be a finite language reduct of G which is injective. Then 
CSP(F) can he reduced to CSP(Boole(F)) in polynomial time. 

Proof. Let $ be an instance of CSP(F), with variable set W . We create an 
instance of CSP(Boole(F)) as follows. The variable set of is the set of 
unordered pairs of variables from When (j) — . . . , Xk) is a constraint 

in then ^ contains the constraint 

Boole(7?)(a::i,2, xi^2>, • • • , ^i,/c, ^2,3, • • • , ^/c-i,/c). 

It is straightforward to verify that ^ can be computed from $ in polynomial 
time, and that $ is a satisfiable instance of CSP(F) if and only if * is a 
satisfiable instance of CSP(Boole(F)). □ 

The Boolean majority operation is the unique ternary function / on a 
Boolean domain satisfying f{x^x^y) — f{x^y^x) — f{y^ X , X ^ — X . The 
Boolean minority operation is the unique ternary function / on a Boolean 
domain satisfying f{x, x, y) = f{x, y, x) = f{y, x, x) = y. 
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Lemma 58. Let T be a finite language reduct of G which is injective, and 
suppose it has an polymorphism of type minority (majority). Then Boole(r) 
has a minority (majority) polymorphism, and hence CSP(Boole(r)) can be 
solved in polynomial time. 

Proof. It is straightforward to show that Boole (F) has a minority (majority) 
polymorphism, and well-known (see [25j) that CSP(Boole(r)) can then be 
solved in polynomial time. □ 



Lemmas [551 Pj\ and [58| together provide a proof of Proposition 51 



8.2. Tractability of type minority with balanced projections. We 



move on to reducts as in Case (d) of Proposition 28 



Proposition 59. Let T = (V; E, N,y^, . . .) be a finite language reduct of 
G, and assume that Pol(r) contains a ternary injection of type minority, as 
well as a binary injection which is of type pi and balanced. Then CSP(r) is 
tractable. 

We start by proving that the relations of the reducts under consideration 
can be defined in G by first-order formulas of a certain restricted syntactic 
form; this normal form will later be essential for our algorithm. 

A Boolean relation is called affine if it can be defined by a conjunction of 
linear equations modulo 2. It is well-known that a Boolean relation is affine 
if and only if it is preserved by the Boolean minority operation (for a neat 
proof, see e.g. [15j). 

In the following, we denote the Boolean exclusive-or connective (xor) by 

e. 

Definition 60. A graph formula is called edge affine if it is a conjunction 
of formulas of the form 

xiy^yiW ... V Xky^Vk 

V (i/i ^ vi A • • • A ui ^ vi 

AE{ui,vi)®"'®E{uuVi) =p) 
\/ {ui^viA'" Aui^vi) , 

where p G {0, 1}, variables need not be distinct, and each of k and / can be 
0. 

Definition 61. A ternary operation f \ ^ V \s called straight if for 
every c G the binary operations {x^y) ^ f{x^y,c), {x,z) f{x,c,z)^ 
and (?/, z) f{c, y, z) are balanced injections of type pi. 

We remark that the existence of straight operations and even straight 
minority injections follows from the fact that G contains all countable graphs 
as induced subgraphs. 

Proposition 62. Let R be a relation with a first-order definition over G. 
Then the following are equivalent: 

(1) R can be defined by an edge affine formula; 

(2) R is preserved by every ternary injection which is of type minority 
and straight; 
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(3) R is preserved by some ternary injection injection of type minority, 
and some balanced binary injection of type pi . 

Proof. We first show the imphcation from 1 to 2, that n-ary relations R 
defined by edge affine formulas ^^(xi, . . . 5 Xfi^ are preserved by straight in- 
jections / of type minority. By injectivity of /, it is easy to see that we only 
have to show this for the case that does not contain disequality disjuncts 
(i.e., A: = 0). Now let be a clause from say 

(f) :=(^ui vi A • - - Aui vi 

A (E(ui,vi)®---®E(ui,vi)=p)) 
V (ui = vi A • • • Aui = vi) , 

for p G {0, 1} and ui, . . . ,ui,vi, . . . ,vi G {xi, . . . , Xn}- In the following, it 
will sometimes be notationally convenient to consider tuples in G satisfying a 
formula as mappings from the variable set of the formula to V. Let ^1,^2,^3 : 
{xi, . . . , Xn} V he three mappings that satisfy 0. We have to show that 
the mapping to : {xi, . . . ,x^} V defined by to(x) = /(ti(x), t2(x), t3(x)) 
satisfies 0. 

Suppose first that each of ^1,^2,^3 satisfies ui ^ vi A • • • A ui ^ vi. 
In this case, to(ui) ^ to(vi) A ••• A to(ui) ^ to(vi), since / preserves 7^. 
Note that E{to{ui),to{vi)), for 1 < i < /, if and only if E{ti{ui),ti{vi)) © 
E{t2{ui)^t2{vi)) ® E{ts{ui),ts{vi)) = 1. Therefore, since each ^1,^2,^3 satis- 
fies E{ui, vi) ® • • • ® E{ui, vi) = p^ we find that to also satisfies E{ui^vi) © 
• • • © E{ui^ vi) = p®p®p = p. 

Next, suppose that one of ^1,^2,^3 satisfies Ui = Vi for some (and therefore 
for all) 1 < i < I. By permuting arguments of /, we can assume that ti{ui) = 
ti{vi) for all i G {!,...,/}. Since the function / is straight, the operation 
g : (y^z) f{ti{ui)^y^z) is a balanced injection of type pi. Suppose that 
t2{ui) = t2{vi). Then E{to{ui),to{vi)) if and only if E{t^{ui),t^{vi)), since g 
is balanced. Hence, to satisfies 0. Now suppose that t2{ui) ^ t2{vi). If also 
tsiui) ts(vi), then E{to{ui),to{vi)) if and only if E{t2{ui),t2{vi)) since g is 
of type pi. If on the other hand ts{ui) = ts{vi), then again E{tQ{ui),to{vi)) 
if and only if E(t2{ui),t2{vi)) since g is balanced. In either case, to satisfies 
(/). This shows that / preserves 0, and hence also 

The implication from 2 to 3 is trivial, since every straight injection of type 
minority generates a balanced binary injection of type pi by identification 
of two of its variables. It is also here that we have to check the existence of 
straight injections of type minority; as mentioned above, this follows easily 
from the universality of G. 

We show the implication from 3 to 1 by induction on the arity n of the 
relation R. Let g be the balanced binary injection of type pi, and let h be 
the operation of type minority. For n = 2 the statement of the theorem 
holds, because all binary relations with a first-order definition in G can be 



defined over G by expressions as in Definition [60 

• For X 7^ y we set = 1 and / — 0. 

• For ^£'(x, y) we can set = 0, / = l,p = 0. 

• For ^A^(x, y) we can set A: = 0, / = 1, = 1. 

• Then, E{x^ y) can be expressed as {x ^ y) A ^A^(x, y). 
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• N(x^ y) can be expressed as (x y) A -^E{x^ y). 

• X = y can be expressed as ^E{x^y) A ^N{x^y). 

• The empty relation can be expressed as E{x^ y) A N{x^ y). 

• Finally, V'^ can be defined by the empty conjunction. 

For n > 2, we construct the formula * that defines the relation R{xi^ . . . , Xn) 
as follows. If there are distinct i, j G {1, . . . ,n} such that for all tuples t in 
R we have ti = tj^ consider the relation defined by 3xi.R{xi^ . . . , Xn)- This 
relation is also preserved by g and /i, and by inductive assumption has a 
definition $ as required. Then the formula ^ \— {xi = xj A $) proves the 
claim. So let us assume that for all distinct i, j there is a tuple t E R where 
ti ^ tj. Note that since R is preserved by the binary injective operation 
this implies that R also contains an injective tuple. 

Since R is preserved by an operation of type minority, the relation Boole(inj(i?)) 
is preserved by the Boolean minority operation, and hence has a definition 
by a conjunction of linear equations modulo 2. From this definition it is 
straightforward to obtain a definition . . . of inj(i?) which is the 

conjunction of A2<j<n formulas of the form 

E{ui,vi)®---®E{ui,vi) = 

for 1^1, . . . , u/, I'l, . . . , 'U/ G {^1, . . . , Xji}- It is clear that we can assume that 
none of the formulas of the form E{ui, vi) ® - - - ® E(ui,vi) = j> in $ can be 
equivalently replaced by a conjunction of shorter formulas of this form. 

For all z,j G {!,..., n} with i < j, let Rij be the relation that holds 
for the tuple (xi, . . . . . . iff . . . , x^, x^+i, . . . 

holds. Because Rij is preserved by g and /i, but has arity n — 1, it has 
a definition as in the statement by inductive assumption. We call the 
conjuncts of also the clauses of We add to each clause of a 

disjunct Xi ^ xj. 

Let be the conjunction composed of conjuncts from the following two 
groups: 

(1) all the modified clauses from all formulas 

(2) when = {E{ui^vi) © • • • © E{ui^vi) = ^) is a conjunct of then 
^ contains the formula 

{ui^viA'" Aui^vi A 0) 
y{ui = vi A " ' Aui = vi) . 

Obviously, is a formula in the required form. We have to verify that 
defines R. 

Let t be an n-tuple such that t ^ R. If t is injective, then t violates a 
formula of the form 

E{ui,vi)®-"®E{ui,vi) =p 

from the formula 4> defining inj(i?), and hence it violates a conjunct of 
of the second group. If there are i,j such that U = tj then the tuple 
'•— (^1, • • • 5 U-i^U+i^ • • • 5 ^n) ^ Rij- Therefore some conjunct (f) of is 

not satisfied by and 4>\/ Xi ^ xj is not satisfied by t. Thus, in this case t 

does not satisfy either. 
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It remains to verify that all t ^ R satisfy Let '0 be a conjunct of 
created from some clause in If ti ^ tj, then '0 is satisfied by t because 
(/) contains Xi ^ xj. If ti = tj, then (ti, . . . , t^+i, . . . ^tn) G Ri^j and thus 
this tuple satisfies This also imphes that t satisfies '0. Now, let '0 be a 
conjunct of ^ from the second group. We distinguish three cases. 

(1) For all 1 < i < / we have that t satisfies Ui — Vi. In this case we are 
clearly done since t satisfies the second disjunct of '0. 

(2) For all 1 < i < / we have that t satisfies Ui ^ Vj. Suppose for 
contradiction that t does not satisfy E{ui, vi) ® • • • ® E{ui,vi) — p. 
Let r G i? be injective, and consider the tuple s \— g{t,r). Then 
s G i?, and s is injective since the tuple r and the function g are 
injective. However, since g is of type pi, we have E{s{ui), s{vi)) if 
and only if E{t{ui),t{vi)), for all 1 < z < /. Hence, s violates the 
conjunct E{ui^ vi) • • •®E(ui^ vi) = p from a contradiction since 
s E inj(i?). 

(3) The remaining case is that there is a proper non-empty subset S 
of {1, . . . ,/} such that t satisfies Ui = Vi for all z G 5 and t sat- 
isfies Ui ^ Vi for all z G {1, . . . ,n} \ S. We claim that this case 
cannot occur. Suppose that all tuples f from inj(i?) satisfy that 
^■^g E{ui^Vi) = d for some d E {0,1}. In this case we could 
have replaced E{ui^vi) © ••• © E{ui^vi) — p the two shorter 
formulas ®i^s^^^i^^i) = ^ 0ie{i,...,^}\^ ^i) = p ® d, 
in contradiction to our assumption on $. So, for each d G {0,1} 
there is a tuple G inj(i?) where ^-^^ E{ui^Vi) = d (and thus 
©iG{i,...,n}\^^(^^'^^) = P®d). Now, for the tuple g{t,si-p) we 
have 

^E{u^,v^)= ^E{u^,v^)® i;,) 

ieS ie[n]\S 
= P®{p®{l-p)) 
— \ — p ^ p 

which is a contradiction since g{t^ 8\-^ G inj(i?). 

Hence, alH G i? satisfy all conjuncts t/^ of We conclude that ^ defines 
R. □ 

We now present a polynomial-time algorithm for CSP(r) for the case that 
a reduct F has finitely many edge affine relations. 

Definition 63. Let F be a finite language reduct of G which has only edge 
affine relations, and let $ be an instance of CSP(F). Then the graph of 
$ is the (undirected) graph whose vertices are unordered pairs of distinct 
variables of and which has an edge between distinct sets {a, b} and {c, d} 
if $ contains a constraint whose definition as in Definition [GO] has a conjunct 
of the form 

(^uiy^viA-'-Auiy^viA {E{ui,vi) © • • • © E{ui, vi) = p)) 
V {ui — vi A ' ' ' Aui — vi) 

such that {a, 6} = {ui^ Vi} and {c, d} — {uj^ Vj} for some z, j G {1, . . . , /}. 
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// Input: An instance <l> of CSP(r) with variables V 
Repeat 

For each connected component C of the graph of ^ do 
Let ^ be the affine Boolean formula inj(<l>, C). 
If ^ is unsatisfiable then 
For each {x^y} e C do 

Replace each occurrence of x by in <l>. 
If ^ contains a false constraint then reject 
Loop 

Until inj(<l>,C) is satisfiable for all components C 
Accept 



Figure 3. A polynomial-time algorithm for CSP(r) when 
F is preserved by a straight operation of type minority. 



It is clear that for F with finite signature, the graph of an instance $ of 
CSP(F) can be computed in linear time from 

Definition 64. Let F be a finite language reduct of G which has only edge 
affine relations, and let $ be an instance of CSP(F). For a set C of 2- 
element subsets of variables of $, we define inj($,C) to be the following 
affine Boolean formula. The set of variables of inj($,C) is C. The con- 
straints of inj($, C) are obtained from the constraints of $ as follows. If 
(/) has a definition as in Definition [60] with a clause of the form 

(^ui^viA'-'Aui^viA {E{ui,vi) • • • z;/) = p)) 

V (ui — vi A • • • A ui — vi) 

where all pairs {ui^ Vi} are in C, then inj($, C) contains the conjunct {ui^vi}^ 
• • • [uuvi] 



Proposition [59| now follows from the following lemma and Proposition 62 



Lemma 65. Let T be a finite language reduct of G which has only edge 
affine relations. Then the algorithm shown in Figure solves CSP(F) in 
polynomial time. 

Proof. We first show that when the algorithm detects a constraint that is 
false and therefore rejects in the innermost loop, then $ must be unsatis- 
fiable. Since variable contractions are the only modifications performed on 
the input formula it suffices to show that the algorithm only equates vari- 
ables X and y when x = ^ in all solutions. To see that this is true, assume 
that := inj($, C) is an unsatisfiable Boolean formula for some connected 
component C. Hence, in any solution 5 to $ there must be a {x, y} in C such 
that s{x) = s{y). It follows immediately from the definition of the graph 
of $ that then s{u) — s{v) for all {u^v} adjacent to {x^y} in the graph of 

By connectivity of C, we have that s{u) — s{v) for all {u^v} G C. Since 
this holds for any solution to $, the contractions in the innermost loop of 
the algorithm preserve satisfiability. 

So we only have to show that when the algorithm accepts, there is indeed 
a solution to When the algorithm accepts, we must have that inj($,C) 
has a solution sc for all components C of the graph of Let 5 be a 
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mapping from the variables of $ to the V such that E(xi^Xj) if {xi^xj} 
is in component C of the graph of $ and sc({xi^Xj}) = 1, and N(xi^Xj) 
otherwise. It is straightforward to verify that this assignment satisfies all of 
the constraints. □ 

8.3. Tractability of type majority with balanced projections. We 

turn to reducts as in Case (e) of Proposition [28} 

Proposition 66. Let T = (V; E, N,y^, . . .) be a finite language reduct of 
G, and assume that Pol(r) contains a ternary injection of type majority, as 
well as a binary injection which is of type pi and balanced. Then CSP(r) is 
tractable. 

A Boolean relation is called bijunctive if it can be defined by a conjunction 
of clauses of size at most two (i.e., it is the solution set to a 2SAT instance). 
It is well-known that a Boolean relation is bijunctive if and only if it is 
preserved by the Boolean majority operation (see e.g. [15]). 

Definition 67. A relation i? on G is called graph bijunctive if it can be 
defined in G by a conjunction of disjunctions of disequalities, and of formulas 
of the form 

xi^yiW ... V Xk^yk 

V {ui = Vi AU2 = V2) , 
where X, y G {E, N}, variables need not be distinct, and k can be 0. 

Proposition 68. Let R be a relation with a first-order definition in G. 
Then the following are equivalent. 

(1) R is graph bijunctive; 

(2) R is preserved by every ternary injection which is of type majority 
and straight; 

(3) R is preserved by some ternary injection of type majority and some 
binary balanced injection of type pi . 



Proof. The proof is very similar to the proof of Proposition [62j We first 
show the implication from 1 to 2, that relations that are graph bijunctive 
are preserved by straight injections / of type majority. By injectivity of /, it 
suffices to show this for the case that the formulas do not contain disequality 
disjuncts (i.e.. A: = 0). Since the clauses of such a formula are such 
that Boole((/)) is bijunctive, the claim follows from the fact that bijunctive 
Boolean relations are preserved by the Boolean majority operation in very 



much the same way as in Proposition 62 



For the implication from 2 to 3, observe that straight injections of type 
majority exist since G contains all countable graphs, and that identifying 
two variables of such an operation yields a balanced injection of type pi. 

We show the implication from 3 to 1 by induction on the arity n of the 
relation R. Let g be the balanced binary injection of type pi, and let h be 
the injection of type majority. For n = 2 the statement of the proposition 
holds because all binary relations with a first-order definition over G can be 
defined as in Definition l67l 
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• for y) we can set = 0, X = y := TV, ui — vi := U2 = 
V2 := dually, ^N{x^y) can be defined; 

• For X ^ this is trivial; 

• E{x^y) can be defined as the conjunct oix ^ y and -iiV(x, ?/); dually, 
we can define N{x^y); 

• The relation x = y can be obtained as the conjunction of ^E{x^ y) 
and -iA^(x, ?/); 

• The empty relation is obtained as the conjunction of E{x^y) and 
N{x,y)', 

• Finally, V'^ can be defined by the empty conjunction. 

For n > 2, we construct the formula ^ that defines the relation . . . , Xn) 

as follows. If there are distinct z, j G {1, . . . ,n} such that for all tuples t in 
R we have = tj, consider the relation defined by 3xi.R{xi^ . . . , x^). This 
relation is also preserved by g and /i, and by inductive assumption has a 
definition $ as required. Then the formula \— {xi = xj A $) proves the 
claim. So let us assume that for all distinct i, j there is a tuple t G i? where 
U 7^ tj. Note that since i? is preserved by the binary injective operation 
this implies that R also contains an injective tuple. 

Since R is preserved by a function of type majority, the relation Boole (inj(i?)) 
is preserved by the Boolean majority operation, and hence is bijunctive. 
From this definition it is straightforward to obtain a definition . . . , Xn) 

of inj(i?) which is the conjunction of Ai<2<j<n^^ formulas of 

the form E{u^v)^ N{u^v)^ or 

X{ui,vi) yY{ui,vi) , 

for 1^1, 1^2, '^1 5 '^2 ^ {xi^ . . . ^Xn}^ and X, y G {E^N}. We can assume (by 
removing successively literals from clauses) that this formula is reduced^ i.e., 
that each of the conjuncts is such that removing any of its literals results in 
an inequivalent formula. 

For all i,j G {1, . . . ,n} with i < j, let Ri^j be the relation that holds 
for the tuple (xi, . . . . . . iff R{xi, . . . , x^, x^+i, . . . 

holds. Because also Ri^j is preserved by g and /i, but has arity n — 1, it has 
a definition as in the statement by inductive assumption. We call the 
conjuncts of also the clauses of We add to each clause of a 

disjunct Xi ^ xj. 

Let be the conjunction composed of conjuncts from the following two 
groups: 

(1) all the modified clauses from all formulas ^ij] 

(2) when (f) = {X{ui^vi) Vy(?i25 ^2)) is a conjunct of then contains 
the formula 

{(/) A ui vi A U2 ^ V2) V {ui = vi Au2 = V2) • 

Obviously, is a formula in the required form. We have to verify that 
defines R. 

Let t be an n-tuple such that t ^ R. If t is injective, then since t ^ inj(i?), 
it violates a clause of the form X{ui^vi) V Y{ui^vi) of and hence the 
corresponding clause in If there are i, j such that U = tj then the tuple 

:= (ti, . . . , ti-i^ti^i^ ' " ^tn) ^ Ri^j' Therefore some conjunct (j) of is 
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not satisfied by and V 7^ xj is not satisfied by t. Thus, in this case t 
does not satisfy either. 

It remains to verify that all t G i? satisfy Let '0 be a conjunct of 
created from some clause in If ti ^ tj, then '0 is satisfied by t because 
'0 contains Xi ^ xj. If ti = tj, then (ti, . . . , t^+i, . . . ^tn) G and thus 
this tuple satisfies This also implies that t satisfies '0. Now, let '0 be a 
conjunct of ^ from the second group, so it is of the form 

^Ij ^ {ul^Vl^U2^V2^ {X{ui,Vi) V Y{U2, V2))) 
V {ui = Vi AU2 = V2) . 
We distinguish three cases. 

(1) The tuple t satisfies both ui = vi and U2 = V2' In this case we are 
clearly done since t satisfies the second disjunct of '0. 

(2) The tuple t satisfies ui ^ V2 and U2 ^ V2- Then the argument is 
exactly the same as the argument in the proof of Proposition [62j 

(3) The remaining case is that t satisfies ui — vi and U2 ^ V2 (or 
ui 7^ vi and U2 — V2^ but the proof there is symmetric). We claim 
that this case cannot occur. If t satisfies Y{u2^V2)^ we are done; 
so let us assume that t satisfies -^Y{u2^V2). Since we assumed that 
$ is reduced^ it follows that there exists a tuple s G inj(i?) (and 
hence in R) where -^X{ui^ui) and Y{ui^vi)] otherwise, we could 
have replaced the clause X{ui^vi) V Y{u2^V2) by X{ui^vi). Then 
the tuple r \— g{t^ s) is also injective, and satisfies -^Y(u2^ U2) (since 
g is of type pi) and it also satisfies -^X{ui^vi) (since g is balanced). 
Since g is injective, we have found a tuple r G inj(i?) that does not 
satisfy X{ui^ vi) V Y{ui^ '^1), a contradiction. 

□ 

Combining the following lemma with Proposition l68] gives us a proof of 



Proposition 66 



Lemma 69. Let V he a reduct of G with a finite signature all of whose 
relations are graph bijunctive. Then CSP(r) can be solved in polynomial 
time. 

Proof. The algorithm for CSP(r) is a straightforward adaptation of the 
procedure given in Figure|3j with the difference that instead of affine Boolean 
equation systems we have to solve 2SAT instances in the inner loop. □ 

8.4. Tractability of types max and min. We are left with proving 



tractability of the CSP for reducts F as in case Case (f) of Proposition 28 
i.e., for reducts which have a canonical binary injective polymorphism of 
type max or min. We first observe that we can assume that this polymor- 
phism is either balanced, or of type max and E'-dominated, or of type min 
and A^-dominated. 

Proposition 70. Let T be a reduct of G. If V has a canonical binary 
injective polymorphism of type max^ then it also has a canonical binary 
injective polymorphism of type max which is balanced or E-dominated. If it 
has a canonical binary injective polymorphism of type min, then it also has 
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a canonical binary injective polymorphism of type min which is balanced or 
N -dominated. 

Proof. We prove the statement for type max (the situation for min is dual). 
Let p be the polymorphism of type max. Then h(x^y) := p(x^p(x^y)) 
is not A^- dominated in the first argument; this is easy to see. But then 
p(h(x^y)^h(y^x)) is either balanced or £^-dominated, and still of type max. 

□ 

We will need the following result which was shown in [2, Proposition 14]. 
For a relational structure F, we denote by F the expansion of F that also 
contains the complement for each relation in F. We call a homomorphism 
between two structures F and A strong if it is also a homomorphism between 
f and A. 

Proposition 71. Let F be an cj- categorical homogeneous structure such that 
CSP(r) is tractable, and let A be a reduct ofV. If A has a polymorphism 
which is a strong homomorphism from F^ to then CSP(A) is tractable as 
well. 

In the following, a strong homomorphism from a power of F to F will 



be called strong polymorphism. We apply Proposition [71] to our setting as 
follows. 

Proposition 72. LetT be a reduct of G with a finite signature, and which is 
preserved by a binary canonical injection which is of type max and balanced 
or E -dominated, or of type min and balanced or N -dominated. Then CSP(F) 
can be solved in polynomial time. 

Proof. We have the following. 

• A canonical binary injection which is of type min and A^-dominated 
is a strong polymorphism of {V] —). 

• A canonical binary injection which is of type max and E'-dominated 
is a strong polymorphism of {V] N, =). 

• A canonical binary injection which is of type max and balanced is a 
strong polymorphism of (V; -lE", =). 

• A canonical binary injection which is of type min and balanced is a 
strong polymorphism of (V; ^N, =). 

The tractability result follows from Proposition [TT] because 

CSP{V;E,^E,N,^N,^,^) 

can be solved in polynomial time. One way to see this is to verify that all 
relations are preserved by a straight polymorphism of type majority, and to 
use the algorithm presented in Section |8.3[ □ 

This completes the proof of the dichotomy statement of Theorem [T] 

9. Classification 

We have proven so far that all reducts of the random graph with finitely 
many relations define a CSP which is either tractable or NP-complete. This 
section is devoted to a more explicit description of the border between 
tractable and hard reducts. 
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Clearly, if we add relations to a reduct T, then the CSP of the structure 
thus obtained is computationally at least as complex as the CSP of F. On 
the other hand, by Lemma [3j adding relations with a primitive positive 
definition to a reduct does not increase the computational complexity of 
the corresponding CSP more than polynomially. Therefore, it makes sense 
to call a reduct primitive positive closed if it contains all relations that are 
primitive positive definable from it, and work with such reducts. Observe 
that primitive positive closed reducts will have infinitely many relations, 
and hence do not define a CSP; however, as we have already discussed in 
Section [3| it is convenient to consider a primitive positive closed reduct F 
tractable if and only if every reduct which has finitely many relations, all 
taken from F, has a tractable CSP. 

The primitive positive closed reducts of G form a complete lattice, in 
which the meet of arbitrary set S of reducts is the reduct which has precisely 
those relations that are relations of all reducts in S. Call a primitive positive 
closed reduct maximal tractable if it tractable and any extension of it is not 
tractable anymore. Under the assumption that P does not equal NP, we 
will now list the maximal tractable reducts of G; there are 17 of them. 
Since any chain C of tractable elements of the lattice of primitive positive 
closed reducts is bounded from above by a tractable element (namely, by 
the reduct which has all relations of all members of C), it then follows from 
Zorn's lemma that a reduct of G is tractable if and only if its relations are 
contained in the relations of one of the reducts of our list. 

Recall the notion of a clone from Section |3l It follows from Theorem [4] 
and Proposition [5] that the lattice of primitive positive closed reducts of G 
and the lattice of locally closed clones containing Aut(G) are antiisomorphic 
via the mappings F Pol(F) (for reducts F) and C Inv(C) (for clones 
C). We refer to the introduction of for a detailed exposition of this well- 
known connection. Therefore, the maximal tractable reducts correspond to 
minimal tractable clones, which are precisely the clones of the form Pol(F) 
for a maximal tractable reduct. In the following, we will list the minimal 
tractable clones; the maximal tractable reducts then are those with relations 
Inv(C) for a minimal tractable clone C. 

Definition 73. Let S be a behavior for functions from G^ to G. A ternary 
injection f : ^ V is hyperplanely of type B if the binary functions 
(x,y) 1-^ f(x^y^c), (x^z) f(x^c^z), and (y^z) f{c^y^z) have behavior 
B for ah ceV. 

We have already met a special case of this concept in Definition [61] of 



Section |8.2[ A ternary function is straight if and only if it is hyperplanely 
balanced and of type pi. 

We now define some more behaviors of binary functions which will appear 
hyperplanely in ternary functions in our classification. 

Definition 74. A binary injection f : ^ V is 

• E-constant if the image of / is a clique; 

• N -constant if the image of / is an independent set; 

• of type xnor if for all u^v ^V'^ with ^^{u, v) the relation E{f{u)^ f{^)) 
holds if and only if EE(i/, v) or NN(i/,, v) holds; 
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• of type xor if for all u^v ^V'^ with ^^{u, v) the relation E{f{u), f{^)) 
holds if and only if neither EE(i/, v) nor NN(i/,, v) hold. 

Before stating the theorem that lists the minimal tractable polymorphism 
clones of reducts of G, we observe that if two canonical functions /, g : 
V satisfy the same type conditions, then they generate the same clone. This 
follows easily from the homogeneity of G and by local closure. 

Theorem 75. Assume that P ^ NP. Then the minimal tractable polymor- 
phism clones of reducts of G are precisely those generated by a function of 
one of following 17 types. Moreover , any two functions of the same type 
generate identical local clones, whereas any two functions of distinct type 
generate distinct local clones. 

(1) A constant operation. 

(2) A balanced binary injection of type max. 

(3) A balanced binary injection of type min. 

(4) An E-dominated binary injection of type max. 

(5) An N -dominated binary injection of type min. 

(6) A ternary injection of type majority which is hyperplanely balanced 
and of type projection. 

(7) A ternary injection of type majority which is hyperplanely E -constant. 

(8) A ternary injection of type majority which is hyperplanely N -constant. 

(9) A ternary injection of type majority which is hyperplanely of type 
max and E-dominated. 

(10) A ternary injection of type majority which is hyperplanely of type 
min and N -dominated. 

(11) A ternary injection of type minority which is hyperplanely balanced 
and of type projection. 

(12) A ternary injection of type minority which is hyperplanely of type 
projection and E-dominated. 

(13) A ternary injection of type minority which is hyperplanely of type 
projection and N -dominated. 

(14) A ternary injection of type minority which is hyperplanely of type 
xnor and E-dominated. 

(15) A ternary injection of type minority which is hyperplanely of type 
xor and N -dominated. 

(16) A binary injection which is E-constant. 

(17) A binary injection which is N -constant. 

Proof. In the following, we refer by "Clone x" to the local clone generated 
by a function as in number x of the theorem. 

We first briefly discuss the tractability of these clones: Clone [T] is tractable 
by Proposition [lO) and Clones 2 to 5 are tractable by Proposition [72| The 
clones generated by a function of type majority or minority (Clones 6 to 15) 



are tractable by Propositions [5T| [59] and |66) In those cases, certain binary 
canonical injections of type projection are required - these are obtained by 
identifying the first two variables of the function of type majority / minority, 
and Figure [4] shows which function of type majority / minority yields which 
type of binary injection. We leave the verification to the reader. Finally, let 
/(x,y) be an £^-constant binary injection generating Clone 16, and denote 
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Binary injection type pi 


Type majority 


Type minority 


Balanced 

£^-dominated 

A/'-dominated 

Balanced in 1st, £^-dom. in 2nd arg. 
Balanced in 1st, A^-dom. in 2nd arg. 


Hp. balanced, type pi 
Hp. £^-constant 
Hp. A/'-constant 
Hp. type max, ^-dom. 
Hp. type min, A/'-dom. 


Hp. balanced, type pi 
Hp. type pi, £^-dominated 
Hp. type pi, A/'-dominated 
Hp. type xnor, £^-dom. 
Hp. type xor, A/'-dom. 



Figure 4. Minimal tractable canonical functions of type 
majority / minority and their corresponding canonical bi- 
nary injections of type projection. 



the reduct corresponding to this clone by F. Then ^(x) := /(x, x) is a homo- 
morphism from F to the structure A induced by the image g[V] in F. This 
structure A is invariant under all permutations of its domain, and hence is 
definable in (^[1^]; =); such structures definable by equality only have been 
called equality constraint languages in [4J , and their computational complex- 
ity has been classified. The structure A has a binary injection among its 
polymorphisms (namely, the restriction of / to A). It then follows from the 
results in [4J that CSP(A) is tractable. Hence, CSP(F) tractable as well, 
since F and A are homomorphically equivalent (cf. Proposition [g]). The 
argument for Clone 17 is identical. 

It is not difficult to see and even automatically verifiable that the clones 
are all distinct, and even incomparable (i.e., no clone of the list contains 
another clone of the list) - a task we leave to the reader or his computer. 

We next prove that any tractable clone which contains Aut(G) (i.e., any 
polymorphism clone of a tractable reduct) contains one of the clones of the 
theorem; together with the fact that the clones of the list are incomparable, 
this implies that our list is exhaustive, and that the clones in our list are 
indeed minimal. So suppose that C is any tractable clone, and denote by 



F the corresponding reduct. We apply Proposition 10 If C contains a con- 



stant operation, then it contains Clone [TJ so we may assume that this is not 
the case. Assume next that C contains e^. Then consider the structure A 
induced in F on the image e^[y]. Since F and A are homomorphically equiv- 
alent, CSP(A) is tractable. Since A is definable in the structure (e^fl/];^) 
it follows from the results in [4J that it has a polymorphism which is either 
a constant or a binary injection. The former case is impossible as otherwise 
also F has a constant polymorphism by composing the constant of A with 
eE- Let thus f{x,y) be the binary injection on e[V] which is a polymor- 
phism of A. Then g{x,y) := f{eE{x),eE{y)) is a polymorphism of F. But 
^ is a binary canonical injection which is E'-constant, and hence C contains 
Clone 16 of our list. The argument when C contains is identical. Hence 
by Theorem [TT] it remains to consider the case where the endomorphisms of 



F are generated by its automorphisms. Let Q := Aut(F). By Theorem 13 



there are five possibilities for Q. Suppose first that Q is the group of all 
permutations on V. Then the assumption that C contains no constant oper- 
ation and the tractability of C imply that C contains all binary injections, by 
the results of [4J. In particular, C contains, say, a balanced binary canonical 
injection operation of type max. But then Clone 2 is contained in C. We 
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can therefore assume that G does not contain all permutations. Next con- 
sider the case where G is preserved by the function — : V ^ V but not by 
sw, which implies that F is first-order interdefinable with (V;R^^'^). Then 
the hard relation p(^) consists of only one orbit in F, and hence is violated 
by an endomorphism of F. By our assumption that the automorphisms of 
F generate the endomorphisms, there is also an automorphism of F which 
violates P^^\ Since i?^^) and P^^) are first-order interdefinable by Proposi- 
this implies that the same automorphism violates i?^^^ and hence 



14 



tion 

a relation of F, a contradiction. Now suppose that G contains sw and is a 
proper subgroup of the full symmetric group. Then the hard relation T has 
just one orbit in F, and since T must be violated by a polymorphism of F, it 
is also violated by an endomorphism, and hence also an automorphism of F. 
This is again a contradiction since the functions in G preserve T. It remains 
to consider the case G = Aut(G); this implies that all polymorphisms of F 
preserve E and N, so F contains the relations E and A^. Thus Proposition [28] 
applies, and C contains a binary canonical injection of type max or min, or 
an injection of type minority or majority. If it contains a canonical injection 
of type max or min, then it contains one of the Clones 2 to 5 by Propo- 



sition [70| Otherwise, it contains a ternary injection t of type minority or 
majority, and one of the binary canonical injections of type projection listed 
in Proposition [28]- denote it by p. Set s{x^y^z) t{p{x^y)^p{y^ z)^p{z^x)) 
and w{x^y^z) \— s(p{x,y),p{y, z),p{z^x)). Then the function w has one of 
the behaviors that describe functions generating Clones 6 to 15 - which of 
the behaviors depends on the precise behavior of and is shown in Figure [Ij 
We leave the verification to the reader. □ 

The following is an operational tractability criterion for reducts of G. 

Corollary 76. Let T be a reduct of G. Then: 

• either F has a canonical polymorphism of one of the 17 types listed 



in Theorem \75\ 

and CSP(F) is tractable, or 

• CSP(F) is NP-complete. 

By inspection of all hardness proofs in this paper and the hardness proofs 
in [4], we also obtain the following relational formulation of our result. Let 
Eq be the 6-ary relation defined by 

{(Xi, X2, yi, 1/2, Zi, Z2) G I {Xi = X2 A ^1 ^ ^2 A Zi ^ Z2) 

V {xi ^ X2 A yi = ^2 A zi 7^ Z2) 

V {xi ^ X2 A yi 7^ ^2 A Zi = Z2)} . 

It is easy to see that Pol (£"6) contains precisely the essentially unary oper- 
ations which after deletion of all dummy variables are injective. 

Corollary 77. Let V be a reduct of G. Then: 

• either one of the relations Eq, T , H , or P^^^ has a primitive positive 
definition in V, 

and CSP(F) is NP-complete, or 

• CSP(F) is tractable. 
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Figure 5. The border: Minimal tractable and maximal 
hard clones containing Aut(G). 



Figure [5] shows the border between the hard and the tractable clones. 
The picture contains all minimal tractable clones as well as all maximal 
hard clones, plus some other clones that are of interest in this context. 
When two clones are connected by a line, we do not mean to imply that 
there are no other clones between them which are not shown in the picture. 
Clones are symbolized with a double border when they have a dual clone 
(generated by the dual function in the sense of Definition 31 , whose behavior 
is obtained by exchanging E with N, max with min, and xnor with xor). 
Of two dual clones, only one representative (the one which has E and max 
in its definition) is included in the picture. The numbers of the minimal 
tractable clones refer to the numbers in Theorem [TH "E'-semidominated" 
refers to "balanced in the first and E'-dominated in the second argument" . 

The following corollary summarizes our results about reducts of G in 
purely mathematical terms (without complexity), and is in particular mean- 
ingful even if P=NP. 
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Corollary 78. Let T be a reduct of G. Then precisely one of the following 
holds. 

• One of the relations Eq, T, H, or P^^^ has a primitive positive defi- 
nition in r. 

• r has a canonical polymorphism of one of the 17 types listed in The- 
oremVfl 



We conclude by giving the argument for the decidabihty claim of Theo- 
rem [TJ 

Proposition 79. There is an algorithm which given a finite set ^ of graph 
formulas decides whether or not the problem Graph- SAT ("if ) is tractable. 



Proof. By Corollary 76, the algorithm only has to check whether one of the 



canonical functions in Theorem 75 preserves all formulas ipin'^f. To do 
so it applies the canonical operation to orbit representatives from tuples 
satisfying '0 in all possible ways, and checks whether the result satisfies '0, 
too. □ 

We remark that it follows also from the results in [llj that it is decidable 
whether or not one of the relations in Corollary [77| has a primitive positive 
definition from a given finite language reduct F of G (of which the relations 



are given as graph formulas). This again yields Proposition 79 

Observe that the algorithm in the proof of Proposition 79 even decides 
tractability of Graph-SAT(^^) in polynomial time if the formulas ^ in are 
given as follows: if R is the, say, fc-ary relation defined by in G, then for 
every orbit of fc-tuples in G that is contained in R the representation of i/j 
has a fc-tuple representing this orbit (with the information which relations 
and = hold on the tuple). Now since the operations the algorithm 
has to consider are at most ternary, the number of possibilities for applying 
a canonical function to orbit representatives is at most cubic in the number 
of orbits satisfying '0, which equals the representation size of i(j under this 
assumption. 
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